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Abstract 



This work is a short, self-contained introduction to subriemannian geometry with 
special emphasis on Chow's Theorem. As an application, a regularity result for the 
Poincare Lemma is presented. 

At the beginning, the definitions of a subriemannian geometry, horizontal vector- 
fields and horizontal curves are given. Then the question arises: Can any two points 
be connected by a horizontal curve? Chow's Theorem gives an affirmative answer 
for bracket generating distributions. (A distribution is called bracket generating if 
horizontal vectorfields and their iterated Lie brackets span the whole tangent space.) 

We present three different proofs of Chow's Theorem; each one is interesting in 
its own. The first proof is based on the theory of Stefan and Sussmann regarding 
integrability of singular distributions. The second proof is elementary and gives 
some insight in the shape of subriemannian balls. The third proof is based on 
infinite dimensional analysis of the endpoint map. 

Finally, the study of the endpoint map allows us to prove a regularity result for 
the Poincare Lemma in a form suited to subriemannian geometry: If for some r > 
all horizontal derivatives of a given function / are known to be r times continuously 
differentiable, then so is /. 

Sections 1 to 3 are the common work of Martin Bauer and Philipp Harms. 
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1 The definition of a subriemannian geometry 



For the basic definitions in differential geometry, see for example the book of Brocker 
and Janich [4]. A more detailed treatment, including calculus in Banach spaces 
and Banach manifolds, can be found in the book of Abraham et al. [1]. A good 
introduction to subriemannian geometry is the book of Montgomery [8] . Throughout 
this paper, manifolds are assumed to be paracompact and smooth. Vectorfields and 
mappings between manifolds are assumed to be smooth if not stated otherwise. 

Definition 1.1. Let M be a manifold. Suppose that for each x G M we are given 
a sub vector space H x of T X M . The disjoint union H = \J_ xeM is called a 
distribution on M. We call tangent vectors in H horizontal. 

Definition 1.2. A distribution H is called regular if the dimension of H x is locally 
constant, and singular otherwise. 

Definition 1.3. Let X(M) denote the set of all vectorfields on M, and let X/ oc (M) 
be the set of local vectorfields, i.e. 



To avoid the need for remarks such as "provided the domains of the vectorfields 
intersect" , we shall declare the vectorfield defined on the empty set to be an element 



Definition 1.4. A local vectorfield is called horizontal if it is horizontal at every 
point where it is defined. Let £#(M) denote the set of horizontal vectorfields and 
£ioc,h(M) the set of local horizontal vectorfields on M. 

Definition 1.5. We say that a set of local vectorfields V C X; oc (M) spans a distri- 
bution H if for all points x G M, H x is the linear hull of the set {X(x) : X G V}. 
We say that a distribution if is a smooth if it is spanned by Xi oc ,h(M). 

Remark 1.6. Every set of vectorfields spans a distribution, which is obviously smooth 
(the linear hull of the empty set is the vector space {0}). 

Definition 1.7. A distribution H is called locally finitely generated if for every point 
x G M there is neighbourhood U of x and a finite set of vectorfields Xi, . . . , G 
Xh(U) spanning H on U . 

Definition 1.8. Let H be a smooth distribution on M. A subriemannian metric 
(•, •) on H is a map that assigns to each point x G M an inner product (•, -) x on 
H x . Let \\X\\ x designate a/ (X, X) x for X G H x . We say that a subriemannian 
metric is smooth if for all local horizontal vectorfields X and Y, x \-t (X(x), Y(x)) x 
is smooth. 

The smoothness of (•, •) does not imply that ||-|| : H — )■ R is continuous, when H 
is endowed with the subspace topology of TM. 




of X loc (M). 
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Example 1.9. To give an example, let H be the distribution on K. spanned by the 
vectorfield x-^, and let (•, •) be such that 

\ox ox / x X 

Then (•, •) is smooth, but is not continuous, where H is given the subspace topology. 
To see that (•,•) is smooth, let Xi,X 2 be horizontal vectorfields defined near zero, 
Xi(x) = fi(x) §- x \ x with fi smooth, fi(0) = 0. Then 

(X 1 (x),X 2 (x)) x = -h(x)f 2 (x) = 1 f ^-{h{xs)f 2 {xs))ds = 

X X Jq O.S 

= [\f[(xs)f 2 (xs) + h{xs)f 2 {xs)) ds e 

To see that || • || is not continuous, take X n = -7= Then X n converges to the 

zero tangent vector at x — 0, but ||X„|| = 1. 

Definition 1.10. A subriemannian manifold is a manifold endowed with a smooth 
distribution and a smooth subriemannian metric on this distribution. 

From now on, we will only consider smooth distributions and smooth subrieman- 
nian metrics. 

Definition 1.11. A curve 7 is called absolutely continuous if it is absolutely con- 
tinuous in any chart, i.e. for any chart (U,u) of M and any interval [t ,ti\ such that 
7([£o, C U, the curve uo7|[t ,£i] is absolutely continuous. 

Definition 1.12. A curve 7 : [a, b] — > M is said to be controlled by local vectorfields 
Xi, ... ,Xk defined near 7([a, &]) if 7 is absolutely continuous and if there are L l - 
functions ui, . . . ,Uk such that the equation 

k 

7(t) = ^^(t)X l ( 7 (t)) 
i=i 

holds almost everywhere, ui, . . . , are called the controls of 7 with respect to the 
vectorfields X ± , . . . ,X k . We also say that the controls steer the point 7(a) to the 
point 7(6). 

Definition 1.13. A curve 7 is called horizontal if it is a concatenation of curves 
that are controlled by horizontal vectorfields. 

Definition 1.14. For a point x G M, the accessible set Acc(x) is the set of all 
points which can be reached by a horizontal curve starting at x. A set S C M is 
called horizontally pathwise connected if any two points in S can be connected by 
a horizontal curve lying in S. 

Lemma 1.15. If 7 is a horizontal curve in a subriemannian manifold, then the 
function t h-> ||7(0ILm ^ s integrable. 
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Proof. A horizontal curve is a concatenation of curves controlled by horizontal vec- 
torfields. If 7 is controlled by horizontal vectorfields X 1 , . . . ,Xk with L 1 -controls 
■ui, ... , Uk, we have 



k k 



l7ll 7(t ) 



E£k(*)lh(*)l (^(7(0),^-(7(*))> 



7(0 



i=i j=i 
k 



<tf(£Mf)i) (£m*)I) = ^|Ei^)i 



. i=l 



where K bounds the continuous functions 



,i=i 



t ^ 



(X i ( 7 (0),A J -( 7 (0)> 7W 



1 < i, j < k. 



□ 



Definition 1.16. If M is a subriemannian manifold, we define the length of a 
horizontal curve 7 : [a, b] — > M by 



*(7)= / 6 ||7(*)ll 7W dt. 

«/ a 



Definition 1.17. The subriemannian distance, also called Carnot-Caratheodory 
distance, between two points x and y on a subriemannian manifold is given by 

tZ(x,j/) = inf /(7), 

where the infimum is taken over all horizontal curves that connect x to y. The 
distance is infinite if there is no such curve. The subriemannian ball of radius e 
centered at x G M is denoted by 

B(e,x) = {y e M : d(x,y) < e}. 

Definition 1.18. A horizontal curve 7 that connects x to y is called a geodesic if 
Z( 7 ) = 



1.1 Accessibility and the set of horizontal curves 

We have defined the accessible set as the set of points reachably by a certain class 
of curves, namely the curves as in Definition 1.13. Theorems 3.20 and 3.66 confirm 
that our choice of admissible curves is a good one. It can be seen in the proofs of 
these theorems that the important point is that we require a curve to be controlled 
by horizontal vectorfields. Bella'iche [3] shows that instead of L 1 -controls, we can use 
any reasonable smaller set of controls without changing the notion of accessibility: 

Theorem 1.19. Let C be a dense subspace in -^ 1 ([0, l];M fe ). Then any point acces- 
sible from x by means of controls in -^ 1 ([0, 1]; M. k ) is also accessible from x by means 
of controls in C . 
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C can even be the set of piecewise constant functions as will be seen in Section 
3.1.2 about orbits. 

It is a different question whether we can enlarge the class of horizontal curves 
without changing the notion of accessibility. For example it would be tempting to 
drop the concept of controllability and instead define a curve to be horizontal if it is 
absolutely continuous and if its derivative lies in H whenever it exists. This works 
out for regular distributions as will be shown in Lemma 1.23 below, but we will see 
in the following example that generally this might change the notion of accessibility. 

Example 1.20. Let H be the distribution on R that has rank zero at points x < and 
rank one at points x > 0. Then H is smooth. It is easy to construct an absolutely 
continuous curve 7 with 7(0) = and 7(2) > for t > such that 7(0) either 
vanishes or does not exist. But such a curve is not controllable by any horizontal 
vectorfield because controllability implies that the curve is constant. To show this, 
let us assume that 7 is controlled by a horizontal vectorfield X. Then 7 satisfies the 
differential equation 

i(t)=u(t)Xfr(t)), 7(0) = 

with u G L l {\Q, By Theorem 3.57 the above differential equation has a unique 
solution for small t. But also the constant curve is a solution to the above equation, 
and therefore 7 is constant. 

Lemma 1.21. A regular, smooth distribution is locally finitely generated, and it is 
a vector subbundle of the tangent bundle. 

Proof. Fix an arbitrary point x G M, and let k be the rank of the distribution 
H near x. Because H is smooth, there are local horizontal vectorfields X 1: . . . ,X k 
such that Xi(x), . . . ,X k (x) span H x . Xi, . . . ,X k are linearly independent also in 
some neighbourhood of x, so they span H in this neighbourhood, and therefore H 
is locally finitely generated. 

Now take arbitrary vectorfields X k+1 , . . . , X n defined near x such that the tangent 
vectors Xi(x) , . . . , X n (x) are linearly independent. Then Xi, . . . ,X n are linearly 
independent also in some neighbourhood of x, and so they form a local frame of 
the tangent space near x. To every tangent vector in the tangent space near x, 
we can associate its coordinates in terms of this local frame. This yields a local 
trivialization for H, and so H is a subbundle of TM. □ 

Lemma 1.22. If (•, •) is a smooth subriemannian metric on a regular smooth dis- 
tribution H , then (•, •) can be extended to a riemannian metric. 

Proof. Locally, we can find a frame Xi, . . . , X n for TM such that X\, . . . X k are hor- 
izontal and orthonormal with respect to (-,•)• On the common domain of definition 
of the vectorfields Xi, we can define a riemannian metric g via 

II for 1 — j. 

To get a riemannian metric defined on all of M, we patch together the obtained 
local riemannian metrics using a partition of unity. □ 
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Lemma 1.23. If H is a regular smooth distribution, then a curve is horizontal if 
and only if it is absolutely continuous and its derivative lies in H whenever it exists. 

Proof. 

(1) To show the one implication, let 7 be a horizontal curve. Then by definition 7 
is a concatenation of curves controlled by horizontal vectorfields. Therefore it 
is absolutely continuous and its derivative lies in H whenever it exists. 

(2) It remains to show the other implication, so let 7 : [a, b] — > M be an abso- 
lutely continuous curve whose derivative is horizontal whenever it exists. Let 
k be the rank of H near 7([a, b]). We can find a = t < ti < ■ ■ ■ < t n = b 
such that 7([ij, U+i]) is contained in some open set Ui, Ui lies in the domain 
of some chart, and on each set Ui there is a set of ortho normal vectorfields 
X it i, . . . ,X i>k G Xn{Ui) spanning H. 7 is absolutely continuous, so by definition 
there are functions v^i G L 1 ([tj, t i+1 \] R) such that for ti <t < t i+1 we have 

n f) 

1=1 1 

We can extend the given subriemannian metric to a riemannian metric and use 
the same symbol (•, •) for it. Then for U < t < t i+ i we have 

k 

3=1 




so on each interval [ti,t i+ i], 7 is controlled by horizontal vectorfields. □ 

2 Basics of differential geometry 
2.1 Vectorfields and flows 

The following basic definitions and Lemmas are taken from Michor [7]. 

Definition 2.1. A local diffeomorphism on M is a C°°-diffeomorphism from an 
open subset U of M onto an open subset V of M. If ipi : Ui — > Vj,(i — 1, 2) are local 
diffeomorphisms, then the composite ipi o (p 2 is a local diffeomorphism, with domain 
y?2 1 (Ui) and range y?i(Vi fl U\). The inverse of </?i is denoted by y?^ 1 , and is a local 
diffeomorphism with domain V\ and range U\. The formal laws 

(Vi V2) V^3 = <Pi (^2 V3) and (Vi V^)" 1 = ^2 1 V 5 ! 1 
are clearly valid. 
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Definition 2.2. A curve of local diffeomorphisms through the identity on M is a 
smooth mapping (p : U — >■ M defined on some open set U C 1 x M such that 

(a) U contains {0} x M and for all points x E M, the set (7nl x {x} is open and 
connected. 

(b) (ft : U fl ({£} x M) ->M, i4 <4>{t,x) is a local diffeomorphism. 

(c) v? = H M . 

</? is called a local flow if in addition, it satisfies 

(d) ip(t, <f(s, x)) = ip(t + s, x) for all t, s, x such that both sides of the equation are 
defined. 



Definition 2.3. The infinitesimal generator of a local flow (p : U — >■ M is the 

atlo' 



vectorfield given by a; H> JU </?(i,x). 



The theory of flows and vectorfields says that for each vectorfield X, there is a 
unique local flow whose infinitesimal generator is X and whose domain of definition 
is maximal with respect to inclusion. 

Definition 2.4. This flow is called the maximal local flow of X and will be denoted 
by Fl x . 

Remark 2.5. As in Definition 2.2, we will use the notations Fl x (t,x) and Fl x (x) to 
denote the same thing. 

Remark 2.6. If X is only a local (and not a global) vectorfield, then its maximal 
local flow is a curve of local diffeomorphism through the identity on the domain of 
definition of X. 

If / : M — > M is a diffeomorphism, then for any vectorfield X E X(M) the 
following mappings are vectorfields as well: 

f*X := Tf' 1 oXof, f,X := Tf o X o 

Definition 2.7. In the above situation, /*X is called the pullback of X along /, 
and f*X the pushforward of X along /. 

Definition 2.8. Let / : M — > N be a smooth mapping. Two vectorsfields X E 
X(M) and Y E X(M) are called /-related, ii Tf o X = Y o f holds, i.e. if the 
following diagram commutes: 

TM TN 

x 

M N 



Lemma 2.9. Let X E X(M) and Y E 3L{M) be f -related vectorfields for a smooth 

if = Fir 



mapping f : M — >■ N '. Then we have f o Fl x = FlJ o / ; whenever both sides are 



defined. In particular, if f is a diffeomorphism, we have Fl{ Y = f 1 o FlJ o f. 
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Proof. We have 



d_ 
dt 



foFl? = Tfo-Fl?=TfoXoFl? = YofoFl> 



x 



and f(Fl x (0,x)) = f(x). So t >->■ f(Fl x (t ) x)) is an integral curve of the vectorfield 
Y on N with initial value f(x), so we have 



Remark 2.10. By the prescription C x (f)(x) : = each vectorfield X G X(M) 

defines a derivation £x on the algebra C°°(M) of smooth functions on M, i.e. a 



mapping £ x : C°°(M) C°°(M) with £ x (f ■ g) = £ x (f) ■ g + f ■ C x {g). This 



defines a vectorspace isomorphism between X(M) and the space of derivations on 
C°°(M). Therefore we can identify a vectorfield X with the derivation C x . We will 
often write X instead of C x when the meaning is clear from the context. 

Definition 2.11. For X, Y G £(M), the Lie bracket [X,Y] G X(M) is the unique 
vectorfield that satisfies the formula C\x,y] — £x ° £>y — £>y ° £>x- The definition of 
the Lie bracket can be extended to arbitrary bracket expressions B(Xi, . . . in 
the obvious way. 

Lemma 2.12. Let f : M ^ N be a smooth map. If X 1 G X(M) is f -related to 
Yi G X(N) andX 2 G X(M) is f -related to Y 2 G X(N), then AiX! + A 2 X 2 is f -related 
to \{Yi + \ 2 Y 2 and [X^X 2 ] is f -related to [Yi,Y 2 ]. 

Proof. The first assertion is immediate. To prove the second, we view tangent 
vectors as derivations on the algebra of germs of smooth functions. Let h be the 
germ of a smooth function on iV at f(x). Then ho f is the germ of a smooth function 
on M at x. By assumption, we have Tf o X^ — Yi o f for % — 1, 2, thus: 



(Xi(x))(hof) = (T x f.Xi(x))(h) = ((TfoX^x))^) = 

= ((Y i of)(x))(h) = (Y l (f(x)))(h). 



(hof ) = C Xl {C X2 (hof))- C X2 (jC Xi (hof)) 

= C Xl {{C Y2 {h)) O /) - C X2 ((C Yl (h)) O /) 

= (C Yl (C Y2 {h))) O / - (Cy 2 (C Yl {h))) O / = O /. 



2.2 Submanifolds 

Definition 2.13. A subset of a manifold M is called a submanifold of M if for 
each p G N there is a chart (U, (p) in M with p G U and n G N such that 



/(F/ X (t, x)) = /(F/ y (t, /(x))) or / o Flf = FlJ o f. 



□ 



So we have C Xi (h of)— [C Yi {h)) o /, and therefore 



But this means Tf o [Xx, X 2 ] = [y ls F 2 ] o /. 



□ 



<p:U ->■ M n+fe =l"xl fc and <^(A n C/) = y?(t/) n (M n x {0}) . 
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One of the most important tools for dealing with submanifolds is the Constant 
Rank Theorem. Proofs of the Constant Rank Theorem can be found in the books of 
Brocker and Janich [4] and Abraham et al. [1]. In Section 3.3, we will give a proof of 
a generalization of this theorem to Banach spaces, but in finite dimensions, it takes 
the following form: 

Theorem 2.14 (Constant Rank Theorem in finite dimensions). Let U C lR m be 

an open set in W 11 . Let f : U — >■ lR n be of class C r , r > 1, and assume that f'{u) 
has constant rank k in a neighbourhood of uq G M m . Then there exist open sets 
U U U 2 C R m and V u V 2 C R n and C r -diffeomorphisms up : V x -> V 2 and ip : Ui ->• U 2 
such that (p/o ^(x 1 , . . . , x m ) = (xi, . . . , Xk, 0, . . . , 0). 

Definition 2.15. An immersion is a smooth map between manifolds such that its 
differential is injective at every point. If i : N — y M is an injective immersion, then 
(N, i) is called an immersed submanifold of M. 

Remark 2.16. An immersion has constant (full) rank at every point. So if we look 
at an immersion in a chart, it is a map as required in the assumptions of Theorem 
2.14. Therefore, in the right charts, an immersion looks like the map (xi, . . . ,x m ) h- >■ 
(xi, . . . ,x m ,0, . . . ,0). 

The following definitions and lemmas concerning initial submanifolds are taken 
from Michor [7]. 

Definition 2.17. For an arbitrary subset A of a manifold M and Xq G A, let C X0 (A) 
denote the set of all x G A which can be connected to x by a smooth curve in M 
lying in A. 

Definition 2.18. Let N be a subset of a manifold M. A chart (U, u) of M centered 
at x G iV is called an initial submanifold chart for iV if 

u(C x (U n iV)) = u{U) n (R n x {0}) for some n G N. 

iV is called an initial submanifold of M if for all x G N, there is an initial 
submanifold chart centered at x. 

Definition 2.19. We will call a mapping % : N — >■ M initial if the following condition 
holds: For any manifold Z a mapping / : Z — > N is smooth if and only if % o f : 
Z — > M is smooth. 

Remark 2.20. Let iV be a submanifold of M. Then the injective immersion % : iV — y 
M is an initial mapping, and iV is an initial submanifold of M. 

Lemma 2.21. Let i : N —y M be an injective immersion and an initial mapping. 
Then i(N) is an initial submanifold of M. 

Proof. Let n denote the dimension of iV and m the dimension of M. Let x G N. 
As noticed in Remark 2.16, because i is an immersion, we may choose a chart (V, v) 
centered at i(x) on M and another chart (W,w) centered at x on iV such that 

(votow~ 1 )(y\...,y n ) = (y 1 ,...,y n ,0,...,0). 
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Let r > be so small that 

{zeR m : \z\ < 2r} C v(V) and {y G R n : \y\ < 2r} C w(W). 

Put 

V x := v-\{z G R m : \z\ < 2r}) C N and Wi := G M n : |y| < 2r}). 

We claim that (Vi,v) is an initial submanifold chart. 

v-^viV,) n (M n x {0})) = ^({(y 1 , . . . , y n , 0, . . . , 0) : \y\ < r}) = 
= io W - 1 o(voio uT 1 )- 1 ^ 1 , . . . , y n , 0, . . . , 0) : \y\ < r}) = 
= % o w~\{y G R n : \y\ < r}) = C n i(iV)). 

The last inclusion holds since i(Wi) C V\ C\i(N) and is smoothly contractible. 

To complete the proof, it remains to show the other inclusion C^iVi fl i(N)) C 
i(Wi). Let z G Ci( x )(V\ fl i(N)). Then by definition, there is a smooth curve 
c : [0, 1] — > M with c(0) = c(l) = z that lies entirely in V\ fl i(iV). Because 
i is an initial mapping, the unique curve c : [0, 1] — > with i o c = c is smooth. 
We claim that c lies entirely in W\. If not, then there is some t G [0, 1] such that 
c(t) G G 1R™ : r < \y\ < 2r}) since c is smooth and thus continuous. But 

then we have 

(v o c )(t) = (u o i)(c(t)) G (v o i o G M n : r < < 2r}) = 

= {(y, 0) G (M Tt x {0}) : r < \y\ < 2r} C {z G M m : r < \z\ < 2r}. 

But (v o c)(t) G {-2 G lR m : r < \z\ < 2r} means c(t) ^ Vi, a contradiction. So c lies 
entirely in Wi, thus z = c(l) = i(c(l)) G □ 

Lemma 2.22. Let N be an initial submanifold of M . Then there is a unique smooth 
manifold structure on N such that the injection i : iV — >■ M is an injective immersion 
and an initial mapping. 

Proof. Uniqueness follows directly from the definition of an initial mapping. To 
prove existence, we claim that the charts {C X {U X fl N),u x ), where (U x ,u x ) is an 
initial submanifold chart for N, yield a smooth manifold structure for iV such that 
% has the desired properties. 

The sets C X (U X fl N) might not be open in the subspace topology of N, so in 
general, the subspace topology of N is not the same as the chart topology. But we 
know that the chart topology is finer than the subspace topology, and therefore it is 
Hausdorff. The chart changings are smooth since they are just restrictions of chart 
changings on M. Clearly, % : N — > M is an injective immersion. 

We need to show that i is an initial mapping. For z G Z, we choose a chart (U, u) 
centered at f(z), such that u(C f{z) (U n N)) = u{U) n (R n x {0}). Then / _1 (C/) is 
open in Z and contains a chart (V,v) centered at z with v(V) a ball. Then f(V) is 
smoothly contractible in UnN, so f(V) C C f{z) (UnN), and (M|C , /(z) (f/nA^))o/ot;- 1 
is smooth. □ 

Remark 2.23. Let iV be an initial submanifold of M. Then the connected com- 
ponents of iV are separable because M admits a riemannian metric which can be 
induced on iV (see Abraham et al. [1, Section 5.5]). However, may have uncount- 
ably many connected components. 
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3 Chow's Theorem 



Definition 3.1. A set of local vectorfields V C Xi oc (M) is called involutive if for 
any 1,7 GV, also [X, Y\ G V. We will call a distribution H involutive if the set of 
local horizontal vectorfields 3Ci OCj h is involutive. 

Definition 3.2. For a set of local vectorfields V C Xi oc (M), the Lie hull £(V) of 
V is the smallest involutive set of local vectorfields which contains V. V is called 
bracket generating if C(V) spans TM. A smooth distribution H is called bracket 
generating (satisfies Hormander's condition) if the set of locally defined horizontal 
vectorfields Xi c,h{M) is bracket generating. 

Theorem 3.3 (Chow's Theorem). If H is a bracket generating distribution on a 
connected manifold, then any two points in the manifold can be connected by a 
horizontal path. 

The converse of Chow's Theorem fails: It is possible that any two points can 
be connected by a horizontal path, but that still the distribution is not bracket 
generating. In Section 3.1 we will see that this is the case exactly if S(%i oc> h(M)) 
spans TM, but jC(Xi oc ,h(M)) does not. 

Example 3.4. The following very easy example for this situation can be found in the 
paper of Nagel et al. [9] and in the book of Jurdjevic [5, page 65]. For an other, 
more elaborate example see Montgomery [8, page 24]. Let H be the distribution on 
IR 2 that is spanned by the two vectorfields 

X 1 = — , X 2 = 0— , 
ox ay 

where is a C°°-function on IR 2 that vanishes on the left-hand plane x < and 
is positive otherwise. Then any two points in the plane can be connected by a 
horizontal curve. But H is not bracket generating because for x < 0, H x is one- 
dimensional and therefore all the Lie brackets vanish. 

Figure 1 shows that in the above example, one encounters irregularities in the 
shape of the subriemannian ball B(e,x) as e increases. The Ball-Box Theorem 3.23 
and also Theorem 3.67 show that these irregularities are ruled out in the case of 
a bracket generating distribution. So non-bracket generating distributions are not 
desirable from a geometric point of view. 



x 



x 



x 



Figure 1: B(e, x) for x = (—1,0) G R 2 and e = |, e = 1, e 



3 
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We will give three different proofs of Chow's Theorem in Sections 3.1, 3.2 and 
3.3, respectively. 
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3.1 First proof of Chow's theorem 

This proof works under the most general assumptions compared to the proofs given 
in Section 3.2 and 3.3. It is based on the theory of Stefan [10] and Sussmann [11]. 
However, the notation and the proofs are taken from the lecture notes of Michor [7] . 
The outline of the proof is the following: The Stefan-Sussmann-theory asserts that 
the accessible set is a maximal integral manifold of a certain integrable distribution 
containing H. This will be shown in Theorem 3.20. The bracket generating condition 
implies that this distribution is equal to TM. Because M is connected, it is the 
maximal integral manifold of this distribution. Using Theorem 3.20, the proof of 
Chow's theorem is only a few lines long; it can be found at the very end of this 
section. 

3.1.1 Integral manifolds 

Definition 3.5. An integral manifold of a distribution if is a connected immersed 
submanifold (N,i) such that T x i(T x N) = H^ x ) for all x G N. An integral manifold 
of H is called maximal, if it is not contained in any strictly larger integral manifold 
of H. A distribution H is called integrable if for any x G M, there is an integral 
manifold of H containing x. 

Remark 3.6. Let if be a distribution on a connected manifold M. Then M is a 
maximal integral manifold of H if and only if H — TM. 

Lemma 3.7. Let H be a smooth distribution on M . Then we have: 

(1) If(N,i) is an integral manifold of H and X G %ioc,h(M), then i*X makes sense 
and is an element ofXi oc (N), which is i\i~ 1 (Ux) '-related to X, where Ux C M 
is the open domain of X . 

(2) If {Ni.ii) and (N 2 ,i 2 ) are integral manifolds of H , then %{ 1 (ii(Ni) Hi 2 (N 2 )) and 
i 2 1 (ii(N 1 ) fl i 2 (N 2 )) are open subsets in Ni and N 2 , respectively; furthermore 
i 2 1 o i 1 is a diffeomorphism between them. 

(3) If x G M is contained in some integral manifold of H, then it is contained in a 
unique maximal one. 

Proof. 

(1) If i(x) G U x for x G N, we have X(i(x)) G H i(x) = TJ{T X N), so i*X(x) := 
((Txi)' 1 o X o i)(x) makes sense. It is defined on an open subset of iV and is 
smooth in x. 

(2) Let X G Xioc,h(M). Then i*X G Xi oc (Nj) and is ^-related to X for j = 1,2. 
By Lemma 2.9 we have 

ijoF(} X = Flfoij. 

Now choose Xj G Nj such that ii(xi) = i 2 (x 2 ) = x G M and choose vectorfields 
Xi,...,Xk G Xioc,H such that {Xi(x), . . . , Xk(x)} form a basis of H x . Then 

/,-(*!,...,**) :=(ff o...oFf)fe) 
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is a smooth mapping denned near zero R k — > Nj. Since 
j = 1,2, we see that /j is a diffeomorphism near zero. Finally we have 



fj = i*Xi for 

o 



- 1 oFl^o...oFlf k k oi 1 )(x 1 ) 

= (F% Xl o...oFf* Xh oi?oi 1 )(x 1 ) 
= h{t\, ■ ■ ■ , tk)- 

So %2 1 o is a diffeomorphism, as required. 

(3) Let N be the union of all integral manifolds containing x. Choose the union 
of all the atlases of these integral manifolds as atlas for N, which is a smooth 
atlas by (2). Note M admits a riemannian metric which can be induced on 
N, so N is separable (see Abraham et al. [1, section 5.5]). □ 

Remark 3.8. If H is an integrable distribution on a manifold M, then by Lemma 
3.7 each point is contained in a unique maximal integral manifold. These maximal 
integral manifolds form a partition of M. This partition is called the foliation of 
M induced by the integrable distribution H, and each maximal integral manifold is 
called a leaf of the foliation. 

Definition 3.9. A set of local vectorfields V C Xi oc (M) is called stable if for all 
X, Y G V and for all t for which it is defined, the local vectorfield (Flf)*Y is 
again an element of V. Let «S(V) denote the minimal stable set of local vectorfields 
containing V. 

Lemma 3.10. Let V C X/ oc (M) be a set of local vectorfields. Then S(V) consists 
of all local vectorfields of the form (Fl* 1 o • • • o Flf k k )*Y for X u Y E V. 

Proof. We have to show that the set of vectorfields of the above form is stable. By 
Lemma 2.9, the flow of such a vectorfield is 

Fl\ 11 tfcj =Fl\o...Fl\oFlT oFl^o..-oFlf k \ □ 

Definition 3.11. A local vectorfield X e Xi 0C (M) is called an infinitesimal auto- 
morphism of a distribution H if T x (Fl x )(H x ) C H Fl xi tx \ whenever defined. Let 
aut(if) denote the set of all infinitesimal automorphisms of H. 

Lemma 3.12. aut(if) is stable. 

Proof. Let X, Y G aut (if), and let Z := (Fl*)*Y. Then 

T x {Flf){H x ) = T x (Fl x ti o FlJ o Fl x ){H x ) C H Fl z^ x) . □ 

Theorem 3.13 (Stefan-Sussmann). Let H be a smooth distribution on a manifold 
M . Then the following conditions are equivalent: 

(1) H is integrable. 
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(2) %i oc ,h(M) is stable. 

(3) There exists a subset V C X; oc (M) such that S(V) spans H . 

(4) aut(#) n %ioc,h{M) spans H. 
Proof. 

(1) =4> (2) Let X G Xioc,h(M) and let L be the leaf through x G M, with i : L ^ M 

the inclusion. Then F/* o i = i o Fl'lf by Lemma 2.9, so we have 

T x {Fl* t ){H x ) = T{Fl x _A.T x i.T x L = T(Fl x t o i).T x L 
= Ti.T x {Ff?).T x L 
= Ti.T Fl i* X (_ ttX -)L = H Fl x^_ tx y 

This implies that {Fl x )*Y G X loCtH {M) for any Y G X loCtH {M). 

(2) => (4) In fact, (2) says that X loCiH C aut(iJ). 

(4) (3) We can choose V = aut(#) n Xi OC)H (M): for X,F G V, we have 
(FZf )*y G Xi OCiH (M); so V C S(V) C Xi oc ,^'(M) and H is spanned by V. 

(3) =^> (1) We have to show that each point x G M is contained in some integral 

submanifold for the distribution H . Let dimH x = n. There are X 1; . . . , X n G 
«S(V) such that Xi(x), . . . ,X n (x) span H x . As in the proof of Lemma 3.7 (2), 
we consider the mapping 

/(fi,...,t„) := {Fl£ o . . . o Fl£)(x) 

defined near zero in W 1 . Since the rank of / at zero is n, the image under / 
of a small open neighbourhood of zero is a submanifold N of M. We claim 
that iV is an integral manifold of H. The tangent space T/(ti,...,t„)^V is linearly 
generated by the the tangent vectors 

£- k (Fl* o . . . o Fl£)(x) = 

= T(Fl* o ■ ■ ■ o Ft£?)X k ((Fl*£ o ■ ■ ■ o Flf : ){x)) 

= ((Fi\y...(Fi x ^ i yx k )(f(t 1 ,...,t n )). 

Since S(V) is stable, these vectors lie in Hf(t u .. mj t„)- They even span Hf( tu ...,t n ) 
because dim Hf^ = dimH x = n, as we will prove now. Indeed, since «S(V) 
spans H and is stable, T x (Flf) is a vector space isomorphism from H x onto 
HFi x (t,x) fc> r ah X G «S(V), and the conclusion follows from the form of /. □ 

Theorem 3.14. Let H be an integrable distribution of a manifold M . Then for each 
x G M there exists a chart (U,u) around x with u(U) = {y G W 11 : \y l \ < e for all i} 
for some e > 0, and a countable subset A C IR m-n 7 such that for the leaf L through 
x we have 

u(U H L) = {y G u{U) : {y n+ \ . . . , y m ) G A}. 

Each leaf is an initial submanifold. If furthermore the distribution has locally con- 
stant rank, the theorem holds with the same n for each leaf L. 
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Definition 3.15. The chart (u,U) of the above theorem is called a distinguished 
chart for the distribution or the foliation. A connected component of U fl L is called 
a plaque. 

Proof of Theorem 3.14- Let L be the leaf through x, dimL = n. Let X±, . . . , X n G 
%Ioc,h(M) be local vectorfields such that Xi(x), . . . , X n (x) is a basis of H x . We 
choose a chart (V, v) centered at x G M such that the vectors 



d 

X ± (x), . . .,X n (x), 



dv n + 



d 



form a basis of T X M. Then 

/fa, ...,*„):= (FZf 1 o ■ ■ ■ o ^(^(O, . . . , 0, T+\ . . . , t m ) 

is a diffeomorphism from a neighbourhood of zero in M. n onto a neighbourhood of x 
in M. Let ([/, u) be the chart given by suitably restricted. We have 

t,GL^(n>-oFif;)(t/)a 

for all ti, . . . , t n and all y for which both expressions make sense. So we have 

/(*!, /(o, . . . , o, ...,r)Gi, 

and consequently L fl [/ is the disjoint union of connected sets of the form {y G {/ : 
(u n+1 (y), . . . ,u m (y)) = constant }. Since L is a connected immersed submanifold 
of M, it is second countable and only a countable set of constants can appear in the 
description of U flL given above. We see that u{C x {LC\U)) = u(U)n(R n x {0}), so we 
have found initial submanifold charts for L, and therefore L is an initial submanifold. 
According to the proof of Lemma 2.22, the charts that make (L,i) an immersed 
submanifold with the property that % : L — > M is an initial mapping are given by 
constraining u to C X {L fl U). However, in Lemma 3.7, L as an integral manifold has 
already been given a smooth manifold structure. We will see immediately that the 
two structures are the same: u\C x (L fl U) is the inverse of the map (ti, . . .t n ) h- >■ 
(Fl* 1 o • • • o Flf n ")(x), and therefore u\C x (L fl U) is just one of the charts given to 
L in Lemma 3.7. 

The arguments given above are valid for any leaf of dimension n meeting U, so 
also the assertion for an integrable distribution of locally constant rank follows. □ 



3.1.2 Orbits and integral manifolds 

Definition 3.16. A pseudo-group of local diffeormorphisms is a set G of local 
diffeormorphisms such that for each if,ip e G, we have 

if o ip G G and (p^ 1 G G, 

where ip o ijj and are as in Definition 2.1. For V C %i c,h(M), let Gy denote 
the smallest pseudo-group of local diffeomorphisms which contains the flows of all 
vectorfields in V. 
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Remark 3.17. The elements of Gy are precisely the maps of the form 

Fl* 1 0...0 Flfr, withX.G V. 

Definition 3.18. Let G be a pseudo-group of local diffeomorphisms on M. Two 
points x,y G M are called G-equivalent if there is a ip G G such that (p(x) = y. This 
defines an equivalence relation on M. Its equivalence classes are called the orbits of 
G or G-orbits. 

Theorem 3.19. Let V C X/ oc (M) and x G M. Then the G^-orbit through x is the 
maximal integral manifold through x of the distribution spanned by S (V) . 

Proof. The distribution spanned by «S(V) is integrable because of the Stefan-Sussmann 
Theorem 3.13. Let L be the leaf through x of the distribution spanned by «S(V), 
and let O be the Gy-orbit containing x. 

(1) To show L C O, we will prove that any two points in L are Gy-equivalent. By 
the part (3) =>- (1) of the proof of the Stefan-Sussmann Theorem 3.13 we know 
that the map 

f(h,...,t n ) :=(Fl*o--.oFl£)(y), 

where X; L G «S(V) and y G L, is a diffeomorphism from an open connected 
neighbourhood of zero in W 1 onto an open subset U of L. Because of Lemma 
3.10 and Lemma 2.9, Fl^ is the concatenation of flows of vectorfields in V. 
Therefore by definition any point in U is Gy-equivalent to y. Because y was 
chosen arbitrarily, every point y G L has a neighbourhood whose points are all 
Gy-equivalent. So the Gy-equivalence classes in L are open in L and form a 
partition of L. Because L is connected, all but one of these equivalence classes 
are empty, and therefore all points in L are Gy-equivalent. 

(2) It remains to show the other inclusion O C L. Let y G O, i.e. 

y = Fl* 1 o • • • o Flf k k (x) with I.eVc S(V). 

Then y G L because L is stable under the flows of vectorfields in <S(V). This 
follows from Lemma 2.9 applied to the pullback i*X of a vectorfield, where 
% : L — > M is the inclusion. □ 

3.1.3 Accessible set and integral manifolds 

Theorem 3.20. Let H be a regular distribution on M and x G M . Then the acces- 
sible set Acc(x) is the leaf through x of the distribution spanned by 5(X/ OCj #(M)). 

Proof. Let E be the distribution spanned by »S(X; OCi ij(M)). Then E is integrable 
because the Stefan-Sussmann Theorem 3.13. Let L be the leaf through x of the 
distribution E. 

(1) To show L C Acc(x), we have to show that L is horizontally pathwise con- 
nected. This is a consequence of Theorem 3.19 with V = £/ OCj #(M) because any 
%ioc,H (M)-orbit is of course horizontally connected. 
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(2) To show the other inclusion Acc(x) C L, we have to show that a horizontal path 
starting in L remains in L. In fact, it suffices to show that a path controlled 
by horizontal vectorfields remains in L. So let 7 : [0, 1] — >■ M be an absolutely 
continuous path such that 

k 

7(*) = E«;(*)*;(7(*)) 
3=1 

with /^-functions Uj. We can pull the vectorfields Xj back to L along the 
inclusion i : L — > M and look at the differential equation 

k 

7(f) = $>(0 (^)(7(0), 7(0) = i-\x) 
i=i 

on L. By restricting the controls to some smaller interval [0,e], we can make 
their L 1 -norm so small that we can apply Theorem 3.57, which gives us a solution 
7 : [0, e] — > L. By uniqueness, 7 and 7 coincide on [0, e]. From these arguments 
we see that 7 locally remains in L. Therefore the set {t G [0, 1] : 7(f) G L} is 
open and non-empty. But also the set {t G [0, 1] : 7(f) ^ L} is open for the 
same reasons. Because [0, 1] is connected, one of these sets must be empty, and 
therefore 7 remains in L. □ 

3.1.4 Proof of Chow's Theorem 

Lemma 3.21. An integrable distribution is involutive. 

Proof. Let H be an integrable distribution; we have to show that 3ii oc ,H is involutive. 
Take vectorfields X, Y G 3l,ioc,h defined near x G M, and let (N, i) be an integral 
manifold of H near x. By Lemma 3.7.1, i*X and i*Y are local vectorfields on N, 
and they are i-related to X and Y, respectively. By Lemma 2.12, also [i*X,i*Y] is 
i-related to [X, Y\. Therefore [X,Y] is horizontal. □ 

Theorem 3.22 (Chow's Theorem). If H is a bracket generating distribution on 
a connected manifold, then any two points in the manifold can be connected by a 
horizontal path. 

Proof. Let E denote the integrable distribution spanned by S(Xi oc ,h(M)), and let 
x G M. By the bracket generating condition and Lemma 3.21, E = TM. M is 
connected, so M is the leaf through x of E. By Theorem 3.20 Acc(x) also is the leaf 
through x of E, so Acc{x) = M. □ 

3.2 Second proof of Chow's Theorem 

This proof works only for regular distributions. The idea of the proof comes from 
Montgomery [8]. He derives Chow's theorem from the Ball-Box Theorem, which is 
interesting in itself. 
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Theorem 3.23 (Ball-box Theorem). Let H be a regular bracket generating distribu- 
tion on an n- dimensional manifold M , and let xq G M. Then there exist coordinates 
yi, . . . ,y n centered at x and positive constants c < C and eo > such that for all 
e < eo, 

Box™(ce) C B(e,x ) C Box™(Ce). 

Here, w is some vector in N n called the weighting associated to the growth vector at 
xq and Box w (e) is the w-weighted box of size e given in coordinates by the expression 

Box w (e) = {y E R : \ Vi \ < e w % i = 1, . . . , n} . 

To prove Chow's Theorem it is sufficient to show that the subriemannian ball 
B(e,xo) is a neighbourhood of xq, and this is all we will prove here. The reason is 
that we were confronted with technical difficulties in the proof given in the book of 
Montgomery [8]. We refer to Nagel et al. [9] for the full proof and a much longer, 
but very nice and complete treatment of the subject. 

The outline of the proof presented in this work is the following: Locally, some 
horizontal vectorfields X±, . . . ,X k together with a selection of their brackets span 
the tangent space. To these vectorfields correspond the flows Fl t 1 , . . . , Fl t k and a 
selection of brackets of these flows. It will take some time to calculate the first non- 
vanishing derivative at t — of the brackets of flows, but this is done in Theorem 
3.31: The first non- vanishing derivative at t — of a bracket of Flf 1 , . . . , Flf k is 
the same bracket applied to X\, . . . ,Xk- In Theorem 3.38 this result will allow us 
to construct a C 1 -diffeomorphism ip from the concatenation of the above flows and 
their brackets. Finally the openness of B(e,Xo) follows from the openness of ip. 

Definition 3.24. For sets of local vectorfields V, W C X/ oc (M), we define 

[V, W] = {[X, Y] : X e V, Y e W} and 

V + w = {X + Y ■. X eV,Y eW} . 

For a set of local vectorfields V, we inductively define the subsets 

Vi = V, V k+1 = V k + [V,V k }. 

Definition 3.25. Let if be a smooth distribution. Then the distribution spanned 
by Xi oc , H (M) k will be denoted by H k . 

Definition 3.26. Let C°°(M) denote the set of smooth functions from M to W. For 
a set of locally defined vectorfields V, let 

span coo(M) V = [J2 u i x i ■ u i e C°°(M),X, e v} , 

where the number of summands in the above sum has to be finite. 
Lemma 3.27. Let V be a set of locally defined vectorfields. Then 

(span C oc (M) V) k = span Coo(M) (V fc ). 
Particularly, ifV spans H , then V k spans H k . 
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Proof. Let W = span C00 ( M )(V). 

(1) span C oo( M )(Vjfc) C Wfc: Clearly, V C W, so we have Vk C V\4, and 

span Coo(M) (Vfe) C span coo(M) (V\4) = W fe . 

The last equality holds because W k is a C 00 (M)-module, as will be shown right 
now. Indeed we claim that if W is a C ,oc (M)-module, then W k is a C°°(M)- 
module, too. This is trivial for k — 1. For > 1 and X G Wfc+i, we can 
write 

X — Y + [Z, W] G Wfc + [W, Wfe]. 
Let / G C°°(M). Then 

fX = fY + (Wf)Z + [/Z, W] G Wfc + [W, W fc ] = W fc+ i, 

and this proves our claim. 

(2) Wfc C spa,n C00 f M \(Vk)' This can be proven by induction. For k — 1, the state- 
ment is trivial. For the inductive step, let X G Wfe+i = W k + [W,V\4]. By 
the inductive assumption, W k C span CO o( M \(Vfc). Therefore, with X i G Vk, 
Yi G V, G Vfe, we can write 

G span C oo (M) (Vfc + i). □ 

Lemma 3.28. If H is a bracket generating distribution, then locally, there is an 
r G N such that H r = TM. 

Proof. Let n be the dimension of M. For x G M, we find vectorfields X±, . . . , X n G 
>C(X; OC) ii-(M)) defined near x such that Xi(x), . . . ,X n (x) form a basis of T X M. Be- 
cause Xi, . . . ,X n are linearly independent at x, they are also linearly independent 
in some neighbourhood of x. Furthermore, because 

C(X loc , H (M)) = Y,XiocH(M) k , 

feeN 

every Xi lies in some Xioc,H(M)kf Therefore, they all lie in Xioc,H(M) r , where 
r = max{/cj : i — 1, . . . , n}. Consequently, H r = TM near x. □ 

Definition 3.29. For a bracket generating distribution such that H r = TM, we 
define the growth vector n(x) G W at x G M to be the vector whose components 
ni(x) are the rank of the distribution Hi at x. (Obviously n(x) may vary from point 
to point.) 

The following theorem and its proof is taken from a paper of Mauhart and 
Michor [6]; it describes the correspondence between brackets of flows and brackets 
of vectorfields. 
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Definition 3.30. For curves of local diffeomorphisms (p and ip, we define the bracket 
of local diffeomorphisms 

[<P, i>] (*, X) = (^Jt 1 O if' 1 O^O <j> t ) ( X ) 

for all (t, x) where the equation is well-defined. As for Lie brackets of vectorfields, 
we extend this definition in the obvious way to arbitrary formal bracket expressions. 

Theorem 3.31. For i = 1 ... k, let ip 1 be curves of local diffeomorphisms through 
the identity. Let Xi be their infinitesimal generators. Then for each formal bracket 
expression B of length k we have 



= * 
dt l 



Bfa,...,^) forl<Kk, 



o 

1 d k 

B(X 1 ,...,X k ) = - — 



In the above theorem, B(Xi, . . . ,X k ) clearly is a vectorfield, but what about the 
k-th derivative of B((p], . . . , <p k )l To explain this, we need the following lemma: 

Lemma 3.32. Let c : R ->■ M be a smooth curve. If c(0) = x E M, c'(0) = 
0, . . . , c( fe_1 )(0) = ; then c^ k \0) is a well-defined tangent vector in T X M which is 
given by the derivation /•->■(/ o c)( fc )(0). 

Proof. We have to show that the Leibnitz rule holds for the map / \-¥ (/ o c)^(0). 
((/■5)ocf(0) = ((/oc).( S o C )f) = 

= E(*)(/°c) (, ' ) (0).((/o C )^)(0) = 

3=0 

= (/oc) W (0) 5 (g) + /(g) (ffoc)«(0) 

since all other summands vanish: (/ o c)^ (0) = for j — 1, . . . ,k — 1. □ 

Lemmas 3.33, 3.34 and 3.35 constitute the proof of Theorem 3.31. In the follow- 
ing, we will use the notation d k — J^. 

Lemma 3.33. Let {p,ip be curves of local diffeomorphisms through Idj^ and let 
f e C°°(M). T/ien we have 



3=0 



For I curves of local diffeormphisms ip 1 , . . . ip l through Hm the multinomial version 
of this formula also holds: 

d?\ {v\ o ■ ■ ■ o <p\yf = jr—p^lo (<dT) ■ ■ ■ (^lo M m )f- 

h+-+jl=k Jl Jl ' 
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Proof. We will only show the binomial version. For a function h(t, s) of two variables 
we have 



#mm) = £ ( k )didt j h(t, S ) 

3=0 



t=s 



since for h(t, s) = f(t)g(s) this is just a consequence of the Leibnitz rule, and linear 
combinations of such decomposable tensors are dense in the space of all functions of 
two variables in the compact C°°-topology, so that by continuity the formula holds 
for all functions. In the following form it implies the lemma: 

#| /(¥>(*, iK*,*))) = £ ( ■) d k -idif{y{t^{s,x)))\ M . □ 

3=0 \ J// 

Lemma 3.34. Let ip be a curve of local diffeormphisms through Mm with first non- 
vanishing derivative 

k\X = d k t \^t- 

Then the inverse curve of local diffeomorphims (p~ l has first non-vanishing derivative 

-k\X=d k t \^ 1 . 
Proof. We have y^ 1 oip t = Mm, so by Lemma 3.33 we get for 1 < j < k 

o = ^i (^r 1 o^)v = E(0(^lo(^)(^"1o(^r 1 r)/ 

i=0 

= ^'lo^wrz+w^'io^r 1 )*/ 

as required. □ 

Lemma 3.35. Let ip be a curve of local diffeormphisms through 1<1m with first non- 
vanishing derivative ml X = d™\ Q (p t , and let ip be a curve of local diffeormphisms 
through Hm with first non-vanishing derivative n\ Y — d^\ ip t . Then the curve of 
local diffeomorphisms [ip, ip] has first non-vanishing derivative 

(m + n)\ [X,Y] = dT +n \ M]t- 
Proof. By the multinomial version of Lemma 3.33 we have 



A N f := dfl^o^o^o^ff 

N\ 
iljlkW 



£ tltL^^Io i^TMl (^TMl W)(%\ W)f- 



i+j+k+l=N 

Let us suppose that 1 < n < m, the case m < n is similar. If N < n all summands 
vanish. If N = n we have by Lemma 3.34 

A N f = (ari (vtT)f + (ario W)f + (%\ ^t l T)f + (^ fc | (Vr 1 )*)/ = o. 
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If n < N < m we have, using again Lemma 3.34 

N\ 
j!Z! 



= (9f| (^- 1 °^D/ + o = o. 

Now we come to the difficult case m,n < N < m + n. By Lemma 3.33 and since all 
other terms vanish we get 

A N f = dfi^o^o^yf 

+ (^) (Tlo tf)(dr m \ i^ 1 ° ^ o U)f + (a? \ tpt)f. 

By Lemma 3.33 again we get 



j\k\l\ 



E ^(^lo i^TMl (^TMl {^T)f 

j+k+l=N • 



E (^)(%\ (^y)(di\ (^ 1 y)f+ (^)(dr m \ (^y)(dr\ (^ 1 y)f+ 



j+l=N 



(dr m \ iU)m\ {-Xf) + Q (ar| (^)>! (-^/) + 31 (v^r 1 )*/ 



m 

= C(» + n)! (XF - YX)f + 9f | Q (v^r 1 )*/ 
From the second expression above one can also read off that 

dr m \ i^ 1 o ^ i ° Ayf = or m \ ((^rf- 

If we put the last conclusions together we get, using Lemma 3.33 and Lemma 3.34 
again: 

A N f = 6» +n (m + n)\ [X, Y]f + d?\ Q {^)* f 

+ Q (dr\o rf)(dr m \ (^If + (d?\ <P*t)f 

= C(» + n)\ [X, Y]f + (31 o <p t )*f 

= 8Z +n (m + n)\ [X,Y]f. □ 

We will use the following two lemmas in our proof of Chow's Theorem. 

Lemma 3.36. Let if : (— e, e) — > M be a smooth curve whose derivatives at zero of 
order up to and including n — 1 are zero, i.e. <// (0) = 0,...,v? (ri_1) (0) = 0. Then 
is a ^-function on [0, e) whose first derivative at zero equals ^y? (n) (0). 
If n is odd, it is a ^-function on (— e, e). 
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Proof. This is a local question, so we can work in a chart and assume without 
loss of generality that M = W 1 and tp(0) = 0. First, we will look at t > only 
and show differentiability from the right at t — using the Taylor formula tp(t) = 

^W(o) + (r). 

lim- L(Vt)-(f(0)) lim ' ( ' -'"•(()) • o(f)^ - 7 " !(U) 



t\o t \ J t\o t \n\ J n\ 



Now, we show that the derivative is continuous from the right at t — 0. We use the 

'(^T)T 



Taylor formula for <// and get (p'(t) = ip( n \0) + o{t n r ), and therefore 



d nr n r t (1/n)_1 

lim— if(vt) = limip'(yt) = 

t\odt *\o n 

= lmV n) (0)-^ — — + o Vt = - , . 

t\(r v ' (n - 1)! n n n\ 

If n is odd, all the above calculations can be done for t < as well. □ 

Lemma 3.37. Let y?i,y?2 : (— e, e) M be smooth functions whose derivatives at 
zero of order up to and including n — 1 are zero and v?i ri) (0) = -</4 n) (0). TTien the 
function 

Vi(^) /ort>0, 



^:(-e,e)->M, ^(*) = 



is a C 1 function. 



Proof. Again, this is a local question, so we can work in a chart and assume without 
loss of generality that M = W 1 and ipi(0) = y?2(0) = 0. We have to show that the 
left and right derivatives of ip at t — are equal. According to Lemma 3.36, the 
right derivative of ip at is ^,</?^(0). To calculate the left derivative, we look at the 
function t (p 2 (—t). Its derivatives at zero of order up to and including n — 1 are 
zero, and its n-th derivative at zero is (—l) n (f^\o). So we can apply Lemma 3.36 
to it and calculate 

to j («(«)-«(<>)) = lim^(-v^) = -lim^ 2 (-^) = 

= -i(- 1 >"^" ) (o) = i(-i)M">(o) = ^S r,, (o). 

□ 

Theorem 3.38. If H is a regular bracket generating distribution on a manifold M , 
then for all e > 0, the subriemannian ball B(e,Xo) is a neighbourhood of Xq. 

Proof. (1) Let n be the dimension of M. We may work locally, so Lemma 3.28 
asserts that there exists an r e N such that H r = TM. Let . . . , n r ) be the 
growth vector at xq. Because H is regular, we can take orthonormal horizontal 
vectorfields Xi, . . . , X ni defined near x spanning H near xq. V = {X±, . . . , X ni } 
spans iJ, so Lemma 3.27 asserts that Vj spans Hj for j = 1, . . . , r. So there are 
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bracket expressions B 1 , . . . , B n such that the vectorfields Xi = B,i (Xi , . . . , X ni ) 
span TM, and such that for 1 < % < ni, Bi is simply a vectorfield, for rii < i < 
ri2, Bi consists of one bracket etc. For i — 1, . . . , n, let Wi denote the number of 
brackets in Bi increased by one. Wi is called the weighting of the vectorfield X^. 

(2) Let </?!,..., (p ni be the local flows of the vectorfields Xi, . . . , X ni . So tpi, . . . , ip ni 
are curves of local diffeormorphisms through the identity. Now, for % — 1, . . . , n, 
let ifii = Bi(ipi, . . . ,ip ni ). Note that this does not change the definition of 
ipi, . . . , ip ni because B±, . . . , B ni are bracket expressions of length zero. Ac- 
cording to Theorem 3.31, in any coordinate system centered at x we have 
(pi(t,x) = t Wt Xi(x) + o(t Wi ). If Wi is even, t Wi is never negative, so we can 
not move in the negative Xi direction. This motivates the approach of adapting 
the commutators of flows. 

(3) For i — 1, . . . , n, we build the formal bracket expression Cj by swapping the ar- 
guments of the outermost bracket in Bi. So, if Ei and Fi are bracket expressions 
such that Bi = [Ei, Fi], then d = [Fi, Ei}. Then we define 

,(p ni )(t,x) for Wi even, t > 0, 
Xi(t,x) = <( Cj(v?i, . . .,ip ni )(t,x) for Wi even, t > 0, 
>i(ip l7 . . . ,ip ni )(t,x) for Wi odd. 

(4) For i — 1, . . . , n, we write 

for even, t > 0, 
-> R, <Ti(*) := <( -t Wi for ^ even, t < 0, 




for Wi odd. 

Then <Tj is a non-smooth homemorphism. Let Sj be its inverse, i.e. 

for even, t > 0, 



R, Si(t) = <( - V=i for ^ even, t < 0, 
w </t for odd. 

Let ifji(t,x) := Xj(sj(i),x) and let 

X(h,...,t n ) := ° • • • °Xn(^n)(^o) 

ip(t!,. . . ,t n ) := ip n (t n ) o ■ ■ ■ o ^i(fi)(a;o)- 

According to Lemmas 3.36 and 3.37, the maps ipi, . . . ,ip n are C 1 in t for each 
fixed x. For each fixed t, they are C°° in x. Therefore ip is a C 1 -map defined on 
a neighbourhood of e R n . We calculate its partial derivatives at 0: 



d_ 
dti 



, Xi(x ) d 

ip = ; — because — 

wA ox 



^ = Id. 

(0,0) 



Xi(xq), . . . ,X n (xo) are linearly independent tangent vectors, so we may apply 
the inverse function theorem and invert ip in a neighbourhood of 0. The inverse 
of ip is C 1 , so it is continuous, and therefore ip is an open mapping. 
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(5) The map x is a composition of Xi(^i)> ■ ■ ■ ■> Xn(t n ), and each map Xi is a bracket of 
the flows </?!,..., </? ni which correspond to the horizontal vectorfields X 1? . . . , X ni . 
If we expand the map x m terms of the flows ip±, . . . , ip ni , let K denote the num- 
ber of flows that appear in this expansion. Now, we claim that x(Box(e/fT)) is 
contained in the subriemannian ball B(e,xo), where 

Box(e) = {y G R n : \ Vi \ < e, i = 1, . . . , n} . 

To prove the claim, we note that for 1 < i < ri\ and any point x in the domain of 
definition of (pi(U), the subriemannian distance d(x, (pi(ti)(x)) is less than e/K if 
\ti\ < e/K. The reason is that x and tp i (ti)(x) can be connected by the horizontal 
path 1 1 — y ipi(t, x),0 < t < ti. The length of this path is \ti\ because ipi is the flow 
of the vectorfield Xi, and X\, . . . , X ni were taken to be orthonormal. Now the 
claim follows because xo and x(A> ■ ■ ■ ,t n ) can be connected by a concatenation 
of K horizontal paths, each of length less than e/K. 

(6) Define the weighted box Box w (e) = {y G K : \yi\ < e Wi ,i — 1, . . . , n} as in the 
Ball-Box Theorem 3.23. Box w \e/K) is a neighbourhood of G R n and ip is 
an open mapping, therefore vp(Box w (e/K)) is a neighbourhood of x . Because 
B(e,x ) D x(Box(e/fr) = ip{Box U) {e/ K)), we have shown that B(e,x ) is a 
neighbourhood of rr as well. □ 

Lemma 3.39. If M is connected and every point has a horizontally pathwise con- 
nected neighbourhood, then M is horizontally pathwise connected. 

Proof. Any connected and locally pathwise connected topological space is pathwise 
connected. Here, we only allow horizontal paths, but the proof remains the same: 
Take an arbitrary point x G M. Then Acc(x ) is open. Let Acc(x ) c denote the 
set of all points not lying in Acc(xq). Acc(xq) c is open because it is the union 
U {Acc(x) : x Acc(x )}. Because M is connected, either Acc{xq) or Acc(xo) c is 
empty. Because xq G Acc(x ), Acc{xq) c must be empty, and Acc(x ) = M. □ 

Theorem 3.40 (Chow's Theorem). If H is a regular bracket generating distribution 
on a connected manifold, then any two points in the manifold can be connected by a 
horizontal path. 

Proof. Theorem 3.38 asserts that every point x in the manifold has a horizontally 
pathwise connected neighbourhood, namely B(e,x), so we can apply Lemma 3.39. 

□ 

3.3 Third proof of Chow's Theorem 

For this proof we need the assumption that H is locally finitely generated. The 
outline of the proof is the following: By Lemma 3.39 it is sufficient to prove that 
every point has a horizontally pathwise connected neighbourhood, so we can work 
locally and assume that Xi, . . . ,X k are defined everywhere and that M is an open 
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connected subset of R n . We will see in Theorem 3.57 that under suitable conditions, 
the differential equation 

k 

7(f) = 5^(0^(7(0), 7(0) = x 
i=i 

has a unique solution 7, and 7 depends smoothly on u and x. Then also the endpoint 
map, that is the map assigning to u the endpoint of 7 is smooth. Using some calculus 
in Banach spaces we will show in Theorem 3.65 that the image of the endpoint map 
is an initial submanifold of M. If H is bracket generating, this initial submanifold 
has the same dimension as M, and therefore it is an open subset of M. In fact, it 
is all of M, and this will conlude the proof of Chow's Theorem. 

Remark 3.41. The most important part in the proof is the fact that the endpoint map 
is a smooth map from a Banach space to M. We can ask if the set of horizontal curves 
can be endowed with the structure of a Banach manifold such that the endpoint 
map is a smooth map from this Banach manifold to M. The idea is to expand the 
derivative of 7 in terms of a local horizontal frame to get coordinates for 7. For 
regular distributions, this is done in Montgomery [8, Appendix D]. 

3.3.1 Calculus in Banach spaces 

A good treatment of calculus in Banach spaces, including the following theorems 
and definitions, can be found in the book of Abraham et al. [1]. 

Definition 3.42. If Xi, . . . ,X k and Y are Banach spaces, let L(Xi, . . . , X k ; Y) 
denote the Banach space of k- multilinear bounded operators of X±, . . . , X k to Y. If 
Xi = X, 1 < i < k, this space is denoted by L k (X, Y). 

Remark 3.43. There are (natural) norm-preserving isomorphisms 

L(X 1 , L(X 2 , . . . , X k )) = L(X 1 , . . . ,X k ; Y) = L(Xi, . . . , X k _i\ L(X k ; Y)) 

— L(X ix , . . . , X ik ; Y) 

where i±, . . . , i k is a permutation of (1, . . . , k). 

Definition 3.44. Let X and Y be Banach spaces, and U an open subset of X. A 
map / : U — > Y is called (Frechet-)differentiable at x e U if there exists a linear 
bounded map L e L(X, Y) such that 

^ \\f{x + h)-f{x)-Lh\\y _ Q 

h^o \\h\\ x 

If it exists, a map L with the above property is unique. L will be denoted f'{x), 
te(x) or jjt(x). Moreover, if / is differentiable at every point in U and if the map 

f:U^L(X,Y), x^f{x) 

is continuous, we say / is of class C 1 or continuously differentiable. Proceeding 
inductively, we define 

f (k) = U^L k (X,Y) 
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if it exists. Here we have identified L(X, L k ~ 1 (X, Y)) with L k (X,Y). If /(*) exists 
and is continuous, we say / is of class C k . f is said to be of class C°° or smooth if 
it is of class C k for all k > 1. 

Definition 3.45. Let U be a subset of a Banach space. Then a map F :[/—>■ U is 
called a contraction if there is L < 1 such that for all x, x G U 

\\F(x) - F(x)\\ <L\\x-x\\. 

Theorem 3.46 (Banach Fixed Point Theorem). Let U be a closed subset of a Ba- 
nach space X and F : U — >■ U a contraction. Then F has a unique fixed point. 

Proof. Choose x G U . We define a sequence x n inductively by x n+ i := F(x n ). First 
we will show by induction that for all n G N 

ll^ra+l — x n\\ ^ L n — Xq\\ . 

For n = the statement is trivial. For the inductive step we get 

\\Xn+2 ~ Xn+l\\ = \\F(x n+1 ) - F(x n )\\ < L \\x n+1 - X n \\ < ||xi - X \\ . 

We use this result to show that Cauchy sequence. Because < L < 1 we 

can find for all e > an N G N with 

-AT _ 



L JV < e- 

Fi - a;o|| 

Using the triangle inequality we get for all m,n G N with m > n > N 

< L m - X ||xi - x || + L m - 2 ||xi - x || + • • • + L n \\ Xl - x \\ 

m—n—l oo 

= \\ Xl -x \\L n Lk < ll^i -x \\L n J2 Lk 

k=0 k=0 

II II m 1 

= \\x 1 - x \\ L _ < e. 

Now x„ is a Cauchy sequence and converges to a point rc* G £/. re* is a fixed point 
of F because 

- F(x*)\\ = lim \\x n - F(x n )\\ = lim \\x n - x n+1 \\ = 0. 

n— >oo n— >oo 

To prove uniqueness of the fixed point, assume that x\ and x\ are fixed points of F . 
Then 

< \x\ - x\\ = \\F(x\) - F(x* 2 )\\ < L - x* 2 \\ . 
< L < 1 implies that \\x\ — x 2 \\ =0 and thus x\ = x* 2 . □ 

Theorem 3.47. Let F : U xVcXxY^-U with the following properties: 

(a) X and Y are Banach spaces 
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(b) U C X is closed 

(c) F is of class C r for r > 

(d) There is L < 1 such that for ally EV we have \\F(x,y) — F(x,y)\\ < L \\x — x\ 
Then for all y e V there exists a fixpoint x* (y) and the function 

x* : V ->• U, y i-> x* (y) 

is of class C r . 

Proof. (1) We will show that the function x* is continuous if F is continuous. 

\\x*(y + h)-x*(y)\\ = 

= \\F(x*(y + h),y + h)-F(x*(y),y + h) + F(x*(y),y + h) - F(x*(y),y)\\ 
<L\\x*(y + h)-x*(y)\\ + \\F(x*(y), y + h) - F(x*(y), y)\\ 



It follows: 



\ x *( y + h)-x*(y)\\< 



1 



\F{x*{y),y + h)-F{x*{y),y)\\ (*) 



Since F is continuous and (1 — L) > the continuity of x* is clear. 
(2) We will show that x* is C 1 if F is C 1 . If x* is differentiable, it must satisfy 



dy 



9 F dx* 9 F 

(y)--^( x *(yly)^ y (y) + -Q y ( x *(y^y)- 



This follows from differentiation of the equality x*(y) = F(x*(y),y). To prove 
that x* is differentiable, we will apply the Banach Fixed Point Theorem to the 
operator 

dF dF 
T(.,y) : L(Y,X) L(Y,X), A ^ —( x *( y ), y )A+—(x*(y),y). 



Because F is a contraction, we have 

dF 



1 — 1 

dx I 



T(A,y)-T(A,y) 



< L, and therefore 



dF 



()x (x*(y),y)A-—(x*(y),y)A 







< L 


A- A 







so T is a contraction. By what we have shown in step (1) there is a unique fixed 
point A*(y) of T and A* is continuous in y. We want to show that A* is the 
derivative of x*. With u := x*(y+h) — x*(y) this is equivalent to ||u — A*(y)h\\ — 
o(\\h\\). Because 



\u-A-(y)h\\ < 



-1 



I-^(x*(y),y))(u-A*(y)h) 
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it is sufficient to show that 

dF 

dx 



x*(y),y))(u-A*(y)h) 



o(\\h\\). 



From the fixed point equation A* (y) 

dF 



T(A*(y),y) we have 



l-^-( X *(y),y))A*(y) 



dF 
dy 



x*(y),y), 



and therefore 
dF 



dx 



x*(y),y) (u-A*(y)h)=u 



dF | 

dx 
dF 



x*(y),y)u 



= F(x*(y + h),y + h)- F(x* (y) ,y)) - — (x* (y) , y)u 



dF 
dy 

dF 

dy 



x*(y),y)h 
x*(y),y)h. 



The above expression is o(||m|| + \\h\\). Since F e C 1 , it satisfies a Lipschitz con- 
dition in y, and one can read off equation (*) that ||w(/i)|| = 0(||/i||). Therefore 
the above expression is also o(||/i||), and this concludes the proof. 

(3) We will show by induction that x* e C r if F e C r . The case r = has been 
treated in step (1). For the inductive step, assume that F e C r+l . Then the 
map T defined in step (2) is C r , and the derivative of x* is the fixed point of 
T. By the inductive assumption, the derivative of x* is C r , and therefore x* is 
C r+1 . □ 

Theorem 3.48 (Inverse Function Theorem). Let X and Y be Banach spaces, and 
let U C X be an open set in X . Let f : U — > Y be of class C r , r > 1, xq G U and 
suppose that the Frechet derivative f'(xo) is a Banach space isomorphism. Then 
f is a C '-diffeomorphism of some neighbourhood of Xq onto some neighbourhood of 
f(x ). 

Proof. First we note that it is enough to prove the theorem under the simplifying 
assumptions that 

X = Y, u = f(x ) = and f(x ) = ld x ■ 

Indeed, we can replace / by the map 

x ^ f'ixo)- 1 o (f(x + x ) - f(x )) . 

With these assumptions we have 

f(x) = y F(x, y) := f(x) - x - y = x. 

The differential ^ is independent of y and vanishes at (0, 0), so there is e > such 
that 



<9F ( 

dx 



x,y) 



< 



whenever 



\x\\ < e. 



Then 



F : B(e,0) x B(e/2,0) ->■ B(e,0) 

is well-defined and satisfies the hypothesis of the Banach Fixed Point Theorem 3.47 
with L = 1/2. The function x*(y) is of class C r and is the inverse of /. □ 
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We will need a generalization of the Constant Rank Theorem 2.14 to Banach 
spaces. 

Definition 3.49. A closed subspace Xi of a Banach space X is said to be split (or 
complemented) if there is a closed subspace X 2 C X such that X = X± © X 2 . 

Remark 3.50. If X and Y are Banach spaces, Y is finite-dimensional, and / : X — > Y, 
then the kernel and the image of / are split. This situation will occur in the analysis 
of the endpoint map in Section 3.3.3. 

Theorem 3.51 (Constant Rank Theorem in Banach spaces). Let X and Y be 
Banach spaces, and let U C X be an open set in X . Let f : U —> Y be of class C r , 
r > 1; u o £ U and suppose that the Frechet derivative f'(uo) has closed split image 
Y\ with closed complement Y 2 and split kernel X 2 with closed complement X\. In 
addition, assume that for all u in a neighbourhood of uq G U , f'(u)(X) is a closed 
subspace ofY and f'(u)\X 1 : X 1 — > f'(u)(X) is a Banach space isomorphism. Then 
there exist open sets 

U 1 CY 1 ®X 2 , U 2 CX, V 1 CY, V 2 CY 

and there are C r -diffeomorphisms 

<f : Vi — > V 2 and if) : U\ — >■ U 2 

such that (ip o / o ip) (x, e) = (x, 0) . 

Proof. 

(1) Write f = f 1 xf 2 with f t : U ->■ Y t for i = 1, 2. Then 

: JC, -» K 

is a Banach space isomorphism. We define 

g:U ->■ Y 1 © X 2 , g(x 1 ,x 2 ) = (fi(x 1 ,x 2 ),x 2 ). 
Then for (h Xl , h X2 ) G X 1 © X 2 we have 

and therefore ^'(wo) : X = Xi © X 2 — >■ Yi © X 2 is a Banach space isomorphism. 
So by the Inverse Function Theorem 3.48 g has a local inverse ^ : Ui — > U 2 C U 
of class C r , = g\U 2 . Then (/i o^)(y 1 ,x 2 ) = yi for (yi,x 2 ) G C/i. 

(2) We claim that 

(/o^yi,^)!^ x {0} : x {0} ^ (/o^)'(y 1 , a ; 2 )(F 1 © X 2 ) 
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is a Banach space isomorphism for (2/1,2:2) G U 1 . To prove the claim, we first 
notice by looking at (*) that 

g'(u)\X 1 x{0}:X 1 x{0}^Y 1 x{0} 

is a Banach space isomorphism. Therefore its inverse maps Y\ x {0} bijectively 
to Xi x {0}. Furthermore, the hypothesis of the theorem says that f'(ip(yi, x 2 )) 
is a Banach space isomorphism when restricted to X\ x {0}, and the claim 
follows by the chain rule 

(/°^)'(yi,z 2 ) = f'(il>(yi,x 2 ))oil>'(y 1 ,x2). 

(3) Let (h Yl , hx 2 ) G Y 1 © X 2 . Then because of (2) there is h Yl such that 

(f°fpy(yi,x 2 )(h Yl ,hx 2 ) = (foi/jy(y u x 2 )(h Yl ,0). (**) 
We see from (1) that 

(fi°^Y(yi,x 2 )(h Yl ,hx 2 ) = h Yl , 
so ft^ equals /i^. But then (**) implies that 

-^-(yi.^) = o, 

so f oip does not depend on the variable x 2 , and we can write 
/rPy.^cy^y, f(yi) = fo 1 p(y 1 ,x 2 ) 
where P Yl :Yi®X 2 —^Yi is the projection. 

(4) Let y = P Yl (^^(uq)). The derivative of / at y is injective and has closed split 
image Y 1 with closed complement Y 2 . Define 

g : P Yl (V) xY 2 cY l ®Y 2 ^Y l ® Y 2 , g( yi ,y 2 ) = f( Vl ) + (0,y 2 ). 

Then the derivative of g at (yo, 0) is a Banach space isomorphism, and therefore 
by the Inverse Function Theorem 3.48 there is a C r -diffeomorphism tp : V\ — > V 2 
with Vi, V 2 C Y and y? -1 = g\V 2 . Because f(yi) = g{yi, 0), we have 

{<p o f)( yi ) = <p(g(y u 0)) = (y u 0) G Fx © F 2 

for all yi such that (yi, 0) G V 2 . Therefore (ip o f o ip)(yi, x 2 ) — (yi, 0). □ 

3.3.2 Superposition operators 

Definition 3.52. Let / : f2 x Xi — >■ X 2 be a function. Then the superposition 
operator associated to / is a mapping F that assigns to each function x : Q — > X\ 
a function Fx : — > X 2 defined as 

Fx(u) = f(u,x(cj)). 

F is also called composition operator or Nemytskij operator. 
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The book of Appell and Zabrejko [2] gives a very good overview of superposition 
operators acting on different spaces of functions. Our interest lies in the situation 
where F acts on continuous functions. The results presented here are generalizations 
of Appell and Zabrejko [2, chap. 6], where only the case Xi = X 2 = K is treated. 

Definition 3.53. If Q is a compact metric space and X is a Banach space, then the 
set C(Q;X) of continuous functions from Q to X is a Banach space when equipped 
with the norm 

\\ X \\c(Q;X) =SUp|Ks)|| x . 

Lemma 3.54. Let X\ and X 2 be Banach spaces, Q a compact metric space, U an 
open subset of X 1} and f : Q x U — >■ X 2 . If f is continuous, then its associated 
superposition operator F : C(Q; U) — > C(Q; X 2 ) is continuous. 

Proof. If / is continuous, then clearly F maps continuous functions to continuous 
functions. It remains to show that F is continuous at every x G C(Q; U). Because 
Q is compact and x is continuous, the graph of a; is a compact subspace of Q x U 
equipped with the product metric 

d^u^xi), (uj 2 ,x 2 )) = max (d(u>i,u) 2 ), \\xi - x 2 \\ Xl ) ■ 

f is continuous, so around every point p in the graph of x we find a metric ball 
E>s( p )(p) with radius 5{p) such that whenever q lies in this ball, we have 

||/(p)-/(?)IU<6/2. 

Finitely many balls B$( p y 2 (p) cover the graph of x, say the balls Bs( Pi )/ 2 (pi) for 
i = 1, ... Let S be the smallest radius of these balls, i.e. 

5 = min {5(pi)/2 : i — 1, . . . , n} . 

Now we claim that if y is a function in C(Q; U), we have 

\\ x ~ y\\c(n; Xl ) < 6 W Fx ~ F y\\c(n ; x 2 ) < e - 

To prove this claim, take a point p y = (u,y(uj)) in the graph of y. Because p x = 
(lu, x(uj)) lies in the graph of x and the graph of x is covered by the balls Bs( Pi )/ 2 (pi), 
there is a point pi in the graph of x such that d(p x ,p io ) < 5(pi )/2. Now, 

d(p y ,Pi ) < d(p y ,p x ) + d(p x ,p io ) <5 + S(p io )/2 < 5(p io ). 

So both p y and p x lie in the same ball Bs( Pi )(Pi )- Therefore 

WfiPx) - f(Py)\\ X2 < ||/(Px) - f(Pi )\\x 2 + MM ~ f(Pv)\\x 2 < e /2 + e/2 = e. 

Since this is true for all points p x = (u, x(u)) and p y = (u, y(u))), u 6 fi, we get the 
desired inequality \\Fx — Fy\\c(n-x 2 ) < e - ^ 

Lemma 3.55. Let X\ and X 2 be Banach spaces, Q a compact metric space, U an 
open subset of X\, and f : QxU — > X 2 . If f is continuous and has a continuous par- 
tial derivative then its associated superposition operator F : C(Q; U) — > C(Q; X 2 ) 
is of class C 1 . 
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Proof. We claim that the Frechet derivative of F at x E C(Q; U) is given by 



df 

F'(x)(h)(uj) = -^-(u,x(u))h(u) or equivalently F'(x) — M{G(x)), 

where M and G are defined as follows: 

M : C(Q;L(X i; X 2 )) -> L(C(Q;X 1 );C(Q;X 2 )), M(g)(h)(u) = g(u)(h(uj)). 

It is easy to see that M is a well-defined bounded linear operator. Therefore M is 
smooth. 

G : C(Q;U) -^C(Q;L(X 1 ;X 2 )) 
is the superposition operator associated to the continuous function 

df 



dx 



■MxU ->• L(X 1 -X 2 ). 



By Lemma 3.54 we know that G is well-defined and continuous. To prove that 
M(G(x)) really is the Frechet derivative of F at x, we have to show that 

(1) For all x G C(f2;£7), M(G(x)) is a bounded linear operator from C (O; X x ) to 
C(Q;X 2 ). 



(2) \\F(x + h)- F(x) - M(G(x))(/ i )|| c(n;X2) = o {\\h\\ c{n;Xi) ). 

(1) is clear from the definition of M. (2) follows from the Taylor formula for /: 

\\ F{x + h) _ F{x) _ M (G(x))(h)\\ c{n;X2) = 



sup 



= sup 



sup 



df 

f(uj,x(u) + h{u)) - f(u,x(u)) - —(u,x(uj))h(uj) 

1 df df 
— — (u, x(u) + th{u))h{uj) dt — — (u, x{u))h{uj) 

1 fdf df \ 

— — (u>, x(u>) + th{u)) — — — (w, (it 
ox ox I 



x 2 



x 2 



< sup sup 

wen o<t<i 



df df 
— (u, x(u) + th{u)) - 7^;(w, 



x 2 
\\h{u)\\ 



L(Xv,X 2 ) 



Xi 



< 



c(n- Xl ) SU P \\G(x + th) - G(x)\\ c{n . L{Xi . X2)) 



0<t<l 



= o \\h 



ic(n ; xi) / ' 



because G is continuous. 



It remains to show that x h- > F'(x) is continuous. This is clear since G is continuous 
and M is smooth. □ 

Theorem 3.56. Let X\ and X 2 be Banach spaces, Q a compact metric space, U a 
compact subspace of X 1 and f : Qx U — > X 2 . If f is continuous and has continuous 
partial derivatives for < k < n, then its associated superposition operator 
F : C(Q; U) ->■ C(n*X 2 ) is of class C n . 
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Proof. The case n = has been proven in Lemma 3.54, and the case n — 1 in Lemma 
3.55. We will prove the theorem by induction. For the inductive step, let / have 
continuous partial derivatives for 1 < k < n + 1. By Lemma 3.55, F is of classC 1 
and its derivative is given by F' = M oG. G is the superposition operator associated 
to the function which is continuous and has n continuous partial derivatives. By 
the inductive assumption, G is of class C n . Because M is C°°, F' = M o G is of 
class C n . So F is of class C n+1 . □ 

3.3.3 The endpoint map 

In all of section 3.3.3, we will use the following assumptions: 

(a) M is an open connected subset of R n . 

(b) H is spanned by a finite number of vectorfields X±, . . . , e X(M). 

(c) There is AT G R such that ||Xi(a;)|| Rf , < K for all x G M. 

(d) There is L G R such that \\Xi(x) - Xi(y)\\ Rn < L for all x,y G M. 
Theorem 3.57. Under the assumptions (a) to (d) there is an open set U , 

U C -^([O, l];R fe ) x M, {0}xMcU 
such that for each (u, x) G U the equation 

k 

7 (t) = ^^(t)X l ( 7 (t)), 7 (0) =x (*) 
i=i 

has a unique solution 7 G C([0, 1]; M) which depends smoothly on u and x. 
Proof. 

(1) For the moment, let us work on an arbitrary interval [a, b] contained in [0, 1] C R. 
We need some conventions about the norms used. Define 

n 

\<E ram / 1% 



\ U \\L 1 ([a,b];R k ) 



E 

i=i 

^ / k(s)|ds 
i=i Ja 



^-f «6 [0,6] 
J=l 



l U llc([o,6];K*) — ^ maX l M »l s ) 



(2) Let 7 denote the integral operator 

I : 



^([0,6];^) -)• C([a,6];R fc ) 

^ (t>->/o«(s)ds). 
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Then J is a linear operator with operator norm equal to one, so I is smooth. 
Define 



F : 



C([a,b];M)xL 1 ([a,b];R k )xM ->■ C([a,b];R n ) 



Note that F is defined on an open set because C([a, b}; M) is open in C([a, b}; IR n ) 
whenever M C M. n is open. By Theorem 3.56, F is smooth. 

(3) We can rewrite the differential equation (*) as an equivalent integral equation 

7 = F(l,u,x). 

(4) Let 70 = F(jo, uq, xq) be a solution on the interval [a, b] with (70,^0,^0) m the 
domain of definition of F. We claim that under the assumption 



ll M 0|| L i (M;M fc) < 3L 

we can find a neighbourhood U of (u , x ) in L 1 ([a, b]; M. k ) x M such that for all 
(u, x) G U the equation 7 = F(j, u, x) has a unique solution 7 G C([a,b]; M). To 
prove the claim, we choose e > small enough such that B(e, 70) C C([a,b]; M). 
Then we define 



G 



5(e, 7o ) x (S(^,«o)nS(^,0)) x £(§, x ) S(e, 7o ) 
(7,m,x) 1— >■ F(x,7,m). 



First, we show that the image of G really is a subset of B(e, 70). 
11(7(7, w, x) -7o|lc([o,6]^») = ll G (7,M,a;) - G(7o,M ,a;o)|| c(M;M n ) 



Mi Xi O 7 - M ,j Xi o 7o 



1=1 



< \\x - X \\ Rn + ^ \\ U i X i 7 - «0,i -^i 7|l ) 1.. .1 ■■ : 



Li([o,6];R") 



i=l 
fc 



+ W u o,i X z o 1 - u 0yi Xi o 7o|| L i (M;D 



i=l 



< ||X X || R „ + ||U M o|lLi([ a ,b];R«=) ^ + ll M o|lLl([a,b];R*0 ^ 117 7o|lc([a,6];I 



e e e 
£ 3 + 3 + 3 =f - 



Furthermore, G satisfies the contraction property 



||G(7,M,a;) -G(7,M,x)|| c(M;Rn) < ^ ||^(Xi o 7 - o 7) || Ll(M;Rn) 



i=i 



- ll u llx,i([a,6];R fc ) ^ HO' 7llc([a,b];R«) - o 117 7llc([o,6];R") 



1 

3 
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for all (7, u, x) and (7, u, x) in the domain of G. Therefore we can apply the 
Banach Fixed Point Theorem 3.47 to G, and this proves our claim with 

U= ( 5( 3^' Mo)njB( 3T' 0) ) xB ^ x ^- 

(5) Now we will work on the interval [0, 1] and drop the assumption that 

1 

ll M 0|| L l( [0 ,l];Rfc) < g£- 

Let 70 = -F(7o, ^0) be a solution on the interval [0,1] with (70,^0,^0) in 
the domain of definition of F . We claim that we can find a neighbourhood U 
of (uq,xq) in L 1 ([0, 1]; x M such that for all (u,x) G U the equation 7 = 
F(7, u, x) has a unique solution 7 G C([0, 1]; M). To prove the claim, we choose 
e > small enough such that B(e, 70) C C([a,b}; M). Let = t < • • ■ < t N — 1 
such that on each interval [t/,tj + i] 

II 1 II 1 

holds. Then we can apply the result obtained in (4) to each of the intervals 
[tj,tj + i] with 

a = tj,b = tj+i,7o = 7o|[t 3 ,t 3+1 ],Mo = uo\[tj,t j+1 ],xo = j{tj). 

We want to concatenate the solutions on these intervals to get a solution on 
[0, 1], so we have to make sure that the endpoint of a solution curve on [tj, tj+i] 
lies in ^(tj +1 )). This is the fact for 




The solution on each interval [tj,tj + \] depends smoothly on the control and 
initial value. Because the operator assigning to each curve its endpoint is a 
linear, continuous operator C([tj, tj +1 \; M) — > M, it is smooth, and therefore 
the concatenation of the solutions curves depends smoothly on x and u . □ 

The remaining part of this section is taken from Bellaiche [3] . 
Definition 3.58. In the situation of Theorem 3.57, we define 

Evol^x) = 7(0 and End^w) = 7(1) 

for t G [0, 1], (u,x) G U and a curve 7 G C([0, 1]; M) with initial value x controlled 
by u. We will call End^ the endpoint map and Evol" the evolution of the controlled 
vectorfield Yli=i u iXi- 
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Remark 3.59. End,, is smooth for each x e M. This is true because the evaluation 
operator 

C([0,1];R")^R", 7 ->7(l) 

is linear and bounded, and therefore smooth, and because furthermore 7 depends 
smoothly on its control u and initial value x by Theorem 3.57. 

Definition 3.60. For u,v e ^([0, l];R fe ) and < s < 1, let u * s v e ^([0,1]; 
be the control function defined by 



(u * s v)(t) 



if < t < s 

T-vC-^) if S < t < 1. 
1— s VI— s' — — 



In many cases we will use s = \ , so we define u*v — u*iv for the sake of a simpler 
notation. Furthermore, we define 

u(t) = -u(l-t) for0<t<l. 

u * s v will be called the concatenation of u and v, and u the inverse control of u. 

Remark 3.61. It is easy to check the following properties: 

(1) * s ^||ii([ 0) i] ; ]Rfc) = ll' lt llz,i([0,l];]R fe ) + ll' U llz, 1 ([0,l];M fc )- 

(2) Evo\\* sV = Evoll o Evol^ 

(3) Evol" is a local diffeomorphism with inverse Evol". 

Definition 3.62. Let r denote the maximal rank of End^. A control u e U is called 
normal if the rank of End x at u is equal to r. 

Lemma 3.63. If u is a normal control, so is u* s v for any < s < 1. 
Proof. The tangent space in L 1 ([0, 1]; M fc ) at u * s v is given by 



d 
dt 



{u* s v)+th, he L\[0,l];R k ). 
The tangent vectors of the form 

(u + th)* s v, he L\[0,l];R k ) 



d 
dt 



constitute a vector subspace of the tangent space at u* s v. We have 



TEnd„ 



d 
dt 



{u + th) * s v 



d 
dt 
d. 
dt 



End x ((u + th) * s v) = 
Evol^ o End,, (u + th) = 



-- T Evol^ oT End,, ( 
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(- 

\dt 



u + th) . 



The vectors 



d 
dt 



o 



u + th, h 



G L 1 ([0,l];M fc 



constitute the tangent space T u L 1 ([0, 1]; M. k )). Let r denote the maximal rank of 
End x . Then because we assumed that u is a normal control, the dimension of 
T End x (TuL 1 ( [0, 1]; R k )) is r. Evol^ is a local diffeomorphism, and this concludes 



Lemma 3.64. For every point y in the image of End x there is a normal control 
steering x to y. 

Proof. Suppose y is attained from x by means of a control u. Choose any normal 
control v. Then the control v * v * u steers x to y and is normal by Lemma 3.63. □ 

Theorem 3.65. For x G M, the image of End x is an initial submanifold, and its 
inclusion into M is an initial mapping. 



(1) Let N denote the set of controls in the domain of End x with maximal rank r. 
By Lemma 3.64, we can restrict End x to N without loosing any points in the 
image of End x , and we will do this without change of notation. 

(2) Now, End-j is a smooth map with constant rank r. The Constant Rank Theorem 
3.51 gives us for every u G iV a neighbourhood U of u such that End x (£/) 
is a small r-dimensional submanifold of M. To make End x (iV) an immersed 
submanifold, we give it the coarsest topology such that End x is an open mapping, 
and we use the charts of the small submanifolds End x (£7) as charts. We need 
to check that each set End^C/) is open in End x (iV), which is trivial, and that 
End x (C/i) l~l End x (C/ 2 ) is open. Let 



We look at the control U3 := u\ * 1 * ix 2 , for which the Constant Rank The- 
orem gives us a neighbourhood C/ 3 of w 3 such that ~Ex\A x {U$) is a submani- 
fold. By making U\ and U 2 small enough, both U\ * 1 * u 2 and u\ * 1 * U 2 
can be made a subset of C/ 3 . So both End a; (L r i) = End x (?7i * u 2 1 * u 2 ) and 
End x (U 2 ) = Enda;(-ui * u 2 1 * C/2) are r-dimensional submanifolds of M lying 
in the r-dimensional manifold End x (C/ 3 ). Hence their intersection is open in 
End x (Us), and therefore also open in End^iV). From these considerations we 
know that (End x (iV),i) is an immersed submanifold, where % : End^iV) — > M 
is the inclusion. 

(3) The rest of the proof is basically the same as the proof of Theorem 3.14. We will 
construct initial submanifold charts for End x (iV). For y = End x (w) G End x (N), 
by the Constant Rank Theorem 3.51, there is a chart (U, 4>) centered at u G iV 
and a chart (V, ip) centered at y G M such that 



the proof. 



□ 



Proof. 



y = End x (ui) = End x (w 2 ) G End x (?7i) n End x (C/ 2 ). 



(^oEnd :c o0- 1 )(t 1 ,...,r,e) = (t 1 ,...,r,O,... 



0). 
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Then 



/ : (t 1 , . . . , t n ) ^ End v ,-i (0 ,... ! o,r+i,...^) o^t 1 , . . . , f, 0, . . . , 0) 

is a diffeomorphism near zero. Let (W,x) be the chart given by suitably 
restricted. For all t 1 , . . . , t n such that both expressions make sense, we have 

f{t\ ...,t n )e End x (N) ^ /(0, . . . , 0, f+\ ...,t n )e End x (N). 

From this we see that End x (N) D W is the disjoint union of connected sets of 
the form 

{w e W : x r+1 { w )> ■■■■> X n { w ) — constant}. 

Since End x (iV) is a connected immersed submanifold of M, it is second count- 
able, and only a countable set of constants can appear in the description of 
End x (iV) H given above. Therefore x is a distinguished chart as in Theorem 
3.14. Furthermore, we see that 

X (C x (End x (N) n WO) = xWHM r x {0}, 

so % is an initial submanifold chart. Because we can find such a chart around 
every point in End x (N), End^iV) is an initial submanifold. 

(4) According to Lemma 2.22, there is a unique smooth manifold structure that 
makes End^iV) an immersed submanifold with the property that the inclusion 
is an initial mapping. It can be seen from the proof of this Lemma that charts 
for this structure are given by restricting the initial submanifold charts (W, x) 
to C x (End x (N) n W). Restricting x to C x (End x (N) n W) yields the inverse of 
the map 

(fV.^O^End, o0- 1 (i 1 ,...,r,O,...,O), 
and this map already is a chart of End^iV). □ 

Theorem 3.66. For x e M, the accessible set Acc(x) equals the image of End x . 

Proof. Let L denote the image of End x . Of course, L is horizontally pathwise 
connected, so we have L C Acc{x). We will show that Acc(x) C L in a few steps. 
For the sake of a simpler notation let ||-|| denote || • II ^^([o i]-M fe ) • 

(1) We claim that for any function u G -^ 1 ([0, 1]; R k ) we have 

lim ||u * s — u\\ = 0, 

where denotes the zero control. This is clear for continuous functions u by 
dominated convergence, and it still holds for arbitrary u by a density argument: 
Indeed, for e > we can find a continuous function v with ||u — v\\ < e/3. Then 

||u * s — u|| < \\u * s — v * s 1 1 + \\v * s — v\\ + \\v — u\\ 

n n n n n n n - e e e 
= \\u — v\\ + \\v * S \J — v\\ + \\v — u\\ < — I 1 — = e 

o o o 

for s close enough to one, and this proves the claim. 
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(2) For j — 1, . . . , k, let ej be the unit vectors in IR fc , and let u be a normal control. 
From 

||u * s ((1 — s) ej) — u\\ < \\u * s ((1 — s) e_j) — u * s 0|| + ||u * s — u\\ 

= ||(1 — s) ej\\ + \\u * s — m|| 

we see that u* s ((1 — s) e,) is still a normal control for s close to one because the 
set of normal controls is open. But End x (-u * s ((1 — s) e,)) = FZ 1 i s (End- r (w)). 
Therefore we can conclude that L is stable under the flows of Xj. The flows 
of Xj are smooth mappings taking values in L. Because % : L — > M is an 
initial mapping, the flows are also smooth as mappings into L, and therefore 
the vectorfields Xj are tangent to L. We conclude that the tangent space of L 
contains H. 

(3) We claim that a horizontal path never leaves L. The proof is exactly the same 
as the one given in Theorem 3.20. □ 

With Theorem 3.66 we are able to prove Chow's Theorem. However, we will first 
prove the slightly stronger 

Theorem 3.67. If H is a bracket generating distribution on M, then the endpoint 
map is open and its maximal rank equals the dimension of M . 

Proof. We have to show that for an open subset U of the domain of definition of End^ 
and for y = End^w) G End x (U), there is a neighbourhood of y that is contained in 
End x (l7). 

(1) Because \\u * s v — u * s 0|| = ||i>|| for an arbitrary control v and by looking at 
step (1) in the proof of Theorem 3.66 we see that there is a neighbourhood V 
of and some s close to one such that u * s V C U. Then also 

End y (V) = End x (u * s V) C End x (l7). 

It is sufficient to show that End J/ (l / ) is a neighbourhood of y. 

(2) For e > let r(e) denote the maximal rank of End^ on the set B(e, 0) in 
L 1 ([0, l];M fc ). r is monotone and non-zero, so there is eo > such that r(e) = 
r(e ) for all e < e . We can choose e even smaller so that additionally B(e , 0) C 
V. Let V denote the set of controls in B(e ,0) where End y has maximal rank 
r(eo). Then V is open. B(e/2,0) contains a control v where End y has rank 
r(eo). Then v * v G V, and y = End y (v * v) G End y {V'). It is sufficient to show 
that End y (y') is neighbourhood of y. 

(3) Endj, has constant rank on V', so by the Constant Rank Theorem 3.51 there 
is an even smaller neighbourhood V" of v * v such that End y (V") is a small 
sub manifold L. It is sufficient to show that L is a neighbourhood of y. 

(4) By the same arguments as in step (2) in the proof of Theorem 3.66 the vector- 
fields Xj are tangent to L, and so are all the brackets of Xj. Because of the 
bracket generating condition, the dimension of L equals the dimension of M, 
and therefore L is an open neighbourhood of y. □ 
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3.3.4 Proof of Chow's Theorem 



Theorem 3.68 (Chow's Theorem). Let H be a locally finitely generated, bracket 
generating distribution on a manifold M. Then any two points in M can be joined 
by a horizontal path. 

Proof. First, we will work locally, and therefore we can assume that M is a con- 
nected, open subset of W 1 , and that H is spanned by Xi,...,X k e X(M). By 
making M small enough we can assure that assumptions (a) to (d) of Section 3.3.3 
hold. So we can apply Theorem 3.66 which asserts that for any x G M, Acc(x) 
equals the image of End x , which is an initial submanifold. Horizontal vectorfields 
are tangent to it, and so are all their brackets. Because of the bracket generating 
condition its dimension equals the dimension of M, and therefore it is open. 

Now we return to the global case. By the previous considerations, every point 
in M has a horizontally pathwise connected neighbourhood. Then by Lemma 3.39 
M is horizontally pathwise connected. □ 



4 The Poincare Lemma 

The Poincare Lemma in its general form states that locally, all closed differential 
forms are exact. For one- forms, this results to the following lemma: 

Lemma 4.1. Let U be a star domain in R n . Let hi : U — >■ K be smooth functions 
such that f— — |~ f or M 1 < hj < n. Then there exists a smooth function 

f : U ->■ R such that hi = 

In the above situation / can be recovered from the functions hi by integration. 
Indeed, if 7 : [0, 1] — > M. n is an absolutely continuous curve, 7(0) = x , 7(1) = x and 

Ui = Iw ' tnen 



/(*) = f(x ) + jf 1 ^^(t) dt = jf 1 j^u>m(l(t)) dt. 

It is possible to adapt this idea to subriemannian geometry. If H is a bracket 
generating distribution, then by Chow's Theorem 3.3 the points x and xq can be 
connected by a horizontal path 7. Using the same procedure as above, / can be 
reconstructed from its horizontal derivatives alone. Indeed, if 7 is controlled by 
horizontal vectorfields X\, . . . , X k with controls «j, . . . , u k £ L l {[0, 1];R), then 

f(x) = /(*„) + jf 1 ^pi(i) dt = jf 1 f^Uiit) W)( 7 (0) dt. (*) 

In the following we will study how continuity and differentiability is passed on from 
the horizontal derivatives of / to /. 
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Lemma 4.2. Let H be a regular bracket generating distribution endowed with a 
smooth subriemannian metric. Let f : M — >■ K. be a function such that 



Xf:M^R, (Xf)(z):=± 



foFlf(x 



exists and is continuous for all X e %i c,h(M). Then for every point x e M t/iere 
zs a neighbourhood U of x and a constant C such that the following statement holds 
for all x G U: 

\f(x)-f(x )\<Cd(x,x ). 
Here d(x,xo) is the subriemannian distance of x and xo as in Definition 1.17. 

Proof. This is a local statement, so we can assume that there are everywhere defined 
horizontal vectorfields X±, . . . , Xk spanning H. Because H is regular, we can assume 
additionally that Xi, . . . Xk are orthonormal. Then for all horizontal curves 7 that 
connect x to x by means of /^-controls Ui we have 

1 

\f(x)-f(x )\<[ £>(f)||W)(7(*))l ^ 
Jo i=i 

1/2 / fc x 1/2 



dt = c^ 1 || 7 (t)|| d* = C-Z( T ), 



where C bounds (£*Li W) 2 ) 1/2 - Therefore 

\f(x)-f(x )\ <Cinf Z( 7 ) = Cd(a;, J /). □ 

7 

It is important to note that — < Cd(x,y) does not imply that / is 

Lipschitz continuous with respect to the Euclidean structure in a chart. This will 
be illustrated by the following example. 

Example 4.3. We look at the function 



f(x,y,z) = (\xf 2 + \yf 2 + |2;| 3/2 j 



1/2 



on the Heisenberg geometry, i.e. M. 3 endowed with the distribution H spanned by 
the vectorfields 

X= — - V -— and X=— + - — 
dx 2 dz dy 2 dz ' 
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From [X!,X 2 ] = ^ we see that H is bracket generating. We calculate 



df 

dx 

df 
dy 

df z 
dz 




5/2 + \yf 2 + \zf 2 ) 1/2 |a;| 3/2 sgn(a:) for (x, y, z) ^ (0, 0, 0) 



for (x,y,z) = (0,0,0) 



5/2 



1 5/2 



+ z 



3/2 ) 1 b| 3/2 sgn(y) for (x, y, z) ^ (0, 0, 0) 



for (x,y,z) = (0,0,0) 



5/2 + |y| 5/2 + \zf 2 ) " 1/2 |z| 1/2 sgn(^) for (x, y, z) ± (0, 0, 0) 



for (x,y,z) = (0,0,0). 



Because of the singularity of |£ at zero, / is not Lipschitz continuous with respect 
to the Euclidean structure. However, the functions 



dx ' dy ' 



2 92 



/ and 



rr (9 \ 



./ 



are continuous, as the following estimates show. We will denote the p-norm on 
by ||-|| , and we will use the fact that all norms on IR 3 are equivalent. 



v 
df 



dx 



df 



< 



< 



x 



5/2 



+ \y\ + \Z 



5/2^ 



-1/2 



\X 



3/2 



lK\\K x ,y, z )\\5/2 



)- 5 / 4 (||(a;,y,2)|| 1 ) 3 / 2 <const(||( a ;,y,2)|| 1 ) 1 / 4 . 



dy 



y_df_ 
2 dz 



x df 
2dz 



< 



< 



< 



x 



5/2 



1 5/2 



1 5/2 



-1/2 



1 3/2 



+ 1 2/ ] + \z\ 

^y^)ll5/2)- 5/4 (ll(^y^)lli) 3/2 < constdK^y,^!!,) 1 ^. 



\x 



5/2 



+ |y| 5/2 + k 



5/2 



-1/2 



1 1/2 



< ^ll(x,y,^)||a||(x,y,^)|| 5/2 )- 5 / 4 (||( a ;,y,2)|| 1 ) 1 / 2 <const(||(x, 2/ ,z)|| i ; 



1/4 



< 



< 



X 



5/2 



+ M 5/2 + k 



5/2 



-1/2 



1 1/2 



x,y,z)||J||(x,y,2)|| 5/2 )- 5 / 4 (||(x,y,^)|| 1 ) 1 / 2 <const(||(x,y,2)|| 1 ) 1 / 4 . 



Therefore also Xif and X 2 / are continuous. Because any horizontal vectorfield 
X can be written as X = f±Xi + fiX-i with smooth functions f\ and / 2 , X/ is 
continuous for all horizontal vectorfields X. So the conditions of Lemma 4.2 are 
satisfied, and the lemma implies that / is Lipschitz continuous with respect to the 
subriemannian distance. But, as we have seen above, / is not Lipschitz continuous 
with respect to the Euclidean structure. Note that also the conditions of Theorem 
4.5 are satisfied with r = 0, and that the theorem implies the continuity of /. 
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Remark 4.4. Let us suppose that M is an n-dimensional subriemannian mani- 
fold with distribution H, and that k brackets are needed to express the gradient 
(gfr, • • • , gfs;) in terms of horizontal vectorfields, i.e. k is the smallest number such 
that H k = TM. For some r > k, let / : M — > R be a function such that 



/oFZf(rr) 



exists and is of class C r for all X G £i c,h(M). Then the gradient (J^r, • • • , ^) of / 
is of class C r ~ k , and therefore / is of class (7 r ~ fc+1 . However, the following theorem 
yields a stronger result, namely that / is of class C r no matter how many brackets 
are needed to span the tangent space. 

Theorem 4.5. Let H be a locally finitely generated, bracket generating distribution 
on a manifold M , and r > 0. If f : M — >■ R is a function such that 



Xf:M^R, (Xf)(x):=± 



foFlf{x) 



exists and is of class C r for all X G X/ OCj //(M) ; then f is also of class C r . 

Proof. We will show that / is of class C r at every x G M. This is a local question, 
so we can assume that conditions (a) to (d) of Section 3.3.3 are satisfied, so that 
the endpoint map is well-defined. 

(1) First, we will construct a smooth local section of the endpoint map. That is 
a smooth map u defined near x such that End XQ (u(x)) = x. By Lemma 3.64 
there is a normal control Uq such that End xo (uo) = Xq. By Theorem 3.67 End^ 
is open and the rank of End^ at uq equals the dimension of M, say n. End Xo has 
constant rank near uq, so by the Constant Rank Theorem 3.51 there is a subspace 
E C i/ 1 ([0, l];R fe ), an open neighbourhood U of u , an open neighbourhood V 
of x and charts 

4> : U C ^([0, l];R k ) ->■ <j>(U) cR n xE, 
ijj : V C M ->■ i){V) C R n 

such that for all (t 1 , . . . , t n , e) G 4>{U) we have 

^oEnd :co o0(t 1 ,...,r,e) = (t 1 ,...,r). 

Without loss of generality we can assume that <f>(uo) = 0. Then the map 



u : 



r\{{t\ ...X)-- (t\ ■ ■ ■ , t n , 0) G 4>{U)}) ^([0, 1]; 



has the desired properties. 

(2) Let T be the operator assigning to a control in the domain of End Xo the unique 
curve 7 with this control and 7(0) = Xq. By Theorem 3.57 Y is smooth as a 
map into C([0, 1];M). 
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(3) By Theorem 3.56 about superposition operators and because Xif is of class C r , 
the map 

C([0,1];M)->C([0,1];R), 1 ^{X i f)o 1 

is of class C r . 

(4) Let / denote the map 

^([O.p'jxCdO.l];!^!, (u,h)^ I V Ui (t)hi(t)dt. 

Jo 

I is smooth because it is linear and bounded. 

(5) From equation (*), using the notation X = (X±, . . . , Xk), we have 

f = f(x ) + I(u, (Xf)oTou). 
Therefore / is of class C r . □ 
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